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A novel covariant formalism for the treatment of the transfer and Compton scattering of partially 
polarized light is presented. This was initially developed to aid in the computation of relativistic 
corrections to the polarization generated by the Sunyaev-Zeldovich effect (demonstrated in a com- 
panion paper), but it is of more general utility. In this approach, the polarization state of a light 
beam is described by a tensor constructed from the time average of quadratic products of the electric 
field components in a local observer frame. This leads naturally to a covariant description which is 
ideal for calculations involving the boosting of polarized light beams between Lorentz frames, and is 
' more flexible than the traditional Stokes parameter approach in which a separate set of polarization 

^ basis vectors is required for each photon. The covariant kinetic equation for Compton scattering of 

partially polarized light by relativistic electrons is obtained in the tensor formalism by a heuristic 
Jr^ ' semi-classical line of reasoning. The kinetic equation is derived flrst in the electron rest frame in the 

Q I Thomson limit, and then is generalized to account for electron recoil and allow for scattering from 

, an arbitrary distribution of electrons. 
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PACS numbers: 



The theory of transfer and scattering of polarized ligh t is fundamental to astrophysics and cosmology, and there 

Q, El in El mill a El SeI. 



> 

^ I is an extensive literature dealing with the subject lUl, [Ij, [Ij, |15|, [Ij, |2j, |25|, |2g|, |47l. Most treatments write the 
. transfer equation using the four Stokes parameters /, Q, U, V , which provide a complete description of the radiation 
■ field (there also exist several other approaches to the description of the polarization properties of radiation fields, for 
, example the Jones calculus, Mueller matrices, and coherency matrices; see [43|). These parameters have dimensions of 
' specific intensity, and are functions of time, photon propagation direction, and frequency. In the case of unpolarized 
] photons, a complete description of the radiation field is given by the total specific intensity Stokes parameter J,^, 
f| ■ or equivalently the phase space density of photons. The Stokes parameters are defined with respect to a set of two 
Q-i' orthonormal polarization basis vectors, normal to the photon direction, which must be specified for each possible 
Q photon direction. 

$^ ' However, in Comptonization calculations with polarized photons, this Stokes parameter formalism becomes very 
, cumbersome — the elegance of reducing the description of the radiation field to four functions is achieved at the 
expense of a very complicated transfer equation. Compton scattering involves a relativistic scattering electron in 
' general, and the complete transfer equation involves Lorentz transformation of the Stokes parameters, which is 
• ' somewhat complicated. To get around this difficulty, in the context of computing relativistic corrections to the 
/\ ' polarization generated by the Sunyaev-Zeldovich effect, we found it convenient and illuminating to introduce a novel 
^ . formalism for doin g ra diative transfer calculations with polarized photons, the polarization tensor formalism. In a 
■ - - ' companion paper ('3 7*1, hereafter referred to as Paper II) we apply this formalism to a calculation of the polarization 
generated in the Sunyaev-Zeldovich effect. While the new description is superficially a little more complicated than 
the Stokes parameter approach, it leads to a form of transfer equation which is easier to manipulate and ideal for 
calculations involving the boosting of polarized light beams between Lorentz frames. In this paper we will restrict 
discussion to Compton scattering, but our methods could be extended to other scattering processes without great 
difficulty. 

Our approach is closest in spirit to the "coherency matrix" formalism introduced by |47j (a review of this, as a lead 
in to our formalism, is given in fjjl . The basic idea is that since the Stokes parameters are essentially time averages of 
certain quadratic products of the electric field components of the electromagnetic field (as measured in a local observer 
frame) , by expressing the electric fields in terms of the Maxwell field strength tensor components this leads naturally 
to a classical formalism in which the four Stokes parameters are replaced by a two index complex Hermitian tensor 
. The trace of I^'^ is the usual total intensity Stokes parameter. A tensor analogue of the phase space distribution 
function, f^^, is also easily defined. These objects are collectively termed the polarization tensor (or matrix). A 
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similar formalism has been developed by Challinor 12], but he did not provide a clear physically motivated derivation 
of the form of the Boltzmann equation for Compton scattering, which we provide here. 

In this matrix formalism (in flat space) we associate a 3 x 3 matrix with each photon, rather than a set of polarization 
basis vectors and the associated Stokes parameters. For example, a beam of partially-polarized light travelling in the 
z-direction is described by the Hermitian matrix (termed the polarization matrix): 



where I,Q,U,V are the Stokes parameters (with units of specific intensity). The trace of the matrix is the total 
beam intensity. For a general photon direction n, the beam is described by a matrix I{n), and transversality of the 
polarization implies n^Iij = 0. In general, the polarization matrix is a function of photon frequency and direction as 
well as spatial position and time, I = /(;/, n, x, t). 

The real advantage of this description is that there is no need to perform a complicated rotation of axes when 
examining photons with different direction vectors (in the angular integrations needed in the radiative transfer equation 
for example). In addition, it is simple to extend the 3x3 matrix description to a 4 x 4 manifestly covariant tensor 
description in which Lorentz transformation of polarized beams between frames is easy. In the matrix approach, 
the radiative transfer equation for scattering of polarized radiation is much more straightforward than in the Stokes 
approach. There is no need for rotation of axes to define separate Stokes parameters for the incoming and outgoing 
beams. Both are described by a single polarization matrix. The transfer equation for Thomson scattering is elegantly 
expressed in terms of a set of projection matrices P{ng) which project out of the matrix I{n) the component of 
polarization orthogonal to n<,. By contrast, when using Stokes parameters one has a complicated angular integral 
involving rotation matrices with Euler angles [H 24]. 

We now outline the structure of the paper. In fj] the Stokes and coherency matrix formalisms are reviewed. In iflll 
this notion is generalized and our tensor description of polarized light described, first in a non-covariant manner. The 
covariant formalism is introduced in ijllll and we discuss the properties of the polarization tensors, their evolution 
in the absence of scattering and in the geometrical optics limit, and their relation to the Stokes description. In 
^IIVI the behaviour of the polarization tensor under Lorentz transformation is discussed, and an explicit example 
of the computation of the polarization of a boosted beam presented. In f0 the classical non-relativistic physics of 
the generation of polarization by Thomson scattering in the electron rest frame is discussed using the polarization 
tensor approach. An equation for the time evolution of the distribution function polarization tensor in the electron 
rest frame due to Thomson scattering is derived. Then in ijVIl we derive the Boltzmann collision integral using a 
phenomenological approach based on the master equation of kinetic theory, still in the Thomson limit. As a check, 
we construct the matrix analogue of the radiative transfer equation in the case of a scattering medium composed of 
stationary electrons, which agrees with the results of In ^VIH the full relativistic kinetic equation is obtained, 
working in the rest frame of the initial electron - following the procedure used in the Thomson limit, but using the 
Klein-Nishina cross section and taking into account recoil. The transformation to a common lab frame is then taken, 
to obtain the kinetic equation for scattering from electrons with a general distribution of velocities. We check that 
this can be expressed in a manifestly covariant form. 

Note that throughout the paper, boldface quantities, e.g. p, denote 3-vectors, and quantities with vector arrows, 
e.g. p, denote 4-vectors. The indices of 3-vectors and tensors are denoted with Roman indices, and those of 4-vectors 
and tensors with Greek indices. Both 3x3 and 4x4 matrices are denoted with boldface quantities. Note also that 
3? denotes the set of real numbers, C denotes the set of complex numbers, and 3?e denotes the operation of taking the 
real part. 



The classical description of partially polarized light uses the well known Stokes parameters, which are defined 
operationally in terms of experiments with polarizing plates (the most c omp lete treatment of the Stokes parameter 
formalism for polarized radiative transfer is contained in the monograph [lj|). Physically the Stokes parameters can 
be thought of as time averages of instantaneous products of electric field components. There is a close relationship 
between the Stokes parameters and the notion of the coherence of the two photon polarization states, which is 
described mathematically by the coherency matrix introduced by based on the work of Additional work 
was done by to extend the concept to a spectral coherency matrix. It is worthwhile reviewing the notion of the 
coherency matrix, since this leads naturally to the polarization tensor description. 

We will only consider electromagnetic fields which are superpositions of plane electromagnetic waves. An idealized 
superposition of such waves whose wave- vectors are all perfectly aligned will be termed a beam. Consider first a beam 




(1) 



I. THE COHERENCY MATRIX 



3 



propagating along the z-axis. The transverse electric field components at a specified fixed spatial point {x,y,z) are 
real functions of time, E^\t)^ Ey'\t). These functions can be expressed as a superposition of an infinite number of 
monochromatic waves with arbitrary phases, i.e. as Fourier transforms 



We have assumed here of course that the Fourier transform exists - which is not true for all functions Ex {t) , Ey {t) , 
but we will gloss over this point (the existence of the Fourier transform can be assured without difficulty by working 

with functions which are truncated as t — > zLoo. See for example 0). In order to ensure reality of Ej^\t), the Fourier 

transforms must satisfy Ej{—Lj) = E*{u}). Now we split the integral above into two parts: 

Ef\t) = E,{u^)e-^-'dLu + -j=j^^E,{u)e-^-'doj 



= ^{E,{t)+E;{t))^neE,{t) , (3) 
where we have defined the complex functions Ej{t), conventionally called the analytic signal 0] associated with 

E,{t) = -= / i?,(a;)e— . E,{t) e C (4) 

V ^TT Jo 

We may decompose Ej{uj) uniquely into a real amplitude and complex phase factor: 

£:,H=a,Me'*^(-) . a„0, eSR (5) 

The analytic signal is thus 

2 r°° 

E,{t) = ^ a,(a;)e*'^^(")-*"*dw . (6) 



27r 

How are all of these quantities related to what is measured by a real polarimeter? Generally speaking, polarimeters 
measure the time average of the intensity of the light beam at a fixed spatial point after it has traveled through a 
combination of filters (See e.g. for a good general discussion of astronomical polarimetry). The two basic filter 
elements required to measure the polarization state are a polarizing plate, and a compensator l43 . We shall describe 
how the time average is constructed from the quantities we have defined, and then consider the effect of the two types 
of filter on the beam. 

We first make the simplifying assumption that the beam is quasi-monochromatic, which means that the functions 
Ej{uj) are assumed to be non- vanishing only in a narrow frequency band oj S [wq ~ Aa;/2, ujq + Aa;/2], with Aw ^ wq. 
Physically this means that the beam is a wave-packet of spectral width Aa;, centered roughly on frequency ojq. This 
implies that the functions aj(t), 4>j{t) vary slowly in comparison to cos(a;oO- To see this, first note that we can always 
choose to write the analytic signals in the form 

£;^.(t) =a^.(t)e*['^^ . (7) 

Then it follows from Eqn. Q that 

2 r°° 

a,(Z)e^'^'(*) = ^= / ^,(a;)e-*("-"»)*dw 
V2^ Jo 

pOO 

E,{uj' +ujo)e-"^''duj' . (8) 
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V2^ J- 

Then since Ei{uj' + ujq) vanishes by assumption for |a;'| > Aoj/2, the left hand side is a superposition of Fourier modes 
of low frequency \uj'\ < Atj/2 ^ cjo- 
Then the time average is defined by 



Ef\t))^^r E^^\t)dt, (9) 
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where T is chosen such that Aw <C ^ ^ wq- The quantities measured by the detector will be some combination of 
the following time averaged real quantities (expanding using Eqn. l|2Jl): 

{Ef\t)Ef\t)) ^ \ [{E,{t)E,{t)) + {E,{t)E*{t)) + {EUt)E,{t)) + {E*{t)E*{t))] . (10) 

With the assumption of quasi-monochromaticity we may now ignore time averages which contain the rapidly varying 
phase factor e*'^°* and retain only those over the slowly varying functions aj{t), e*'^^*^*-'. Thus, for example 

{E,it)E,it)) - (a2(^)g2.0.(t)g-2.^o*^ 

= 

(E^Amit)) = {alit)) 

{EAt)Ey{t)) - 

{EAt)E;{t)) = (a,(t)a,(t)e^(*^W-<*«(*))) . (11) 

The non- vanishing elements are all of the form Jy — {Ei{t)E*{t)). We denote the Hermitian matrix of quantities Jij 

the coherency matrix: 



(alit)) {a4t)ay{t)e^(^^(^)-^yW^) 
(a,(t)a,(i)e^('^«(*)-^-(*») (a^(t)) 



(12) 



This matrix was introduced by 47]. Now we relate the elements of the coherency matrix to measurements with 
a polarimeter. With an optical element known as a compensator, a coherent phase delay between the x and y 
components of the beam can be introduced. After passing through this device, the resulting analytic signal has the 
form 

Ej{t) = aj(t)e^l'^^(*)-<^^-'^»*l , (13) 

where the phase difference i5 = e^; — is a known constant. Taking time averages of products of these quantities 
yields 

{E,{t)E*{t)) = e-*(^'-'^'(a,(i)aj(i)e'^^'^*^"'^^^*") 

= e-*(^'-'^)jy . (14) 

The polarization is measured by passing the beam through a further optical element, a polarizing plate oriented at 
angle 9 to the x-direction, and measuring the total intensity of the transmitted light, I{d). The transmitted electric 
field is 

(6*, t) = Ei"-^ (t) cos e + 4''^ it) sine . (15) 

The measured intensity is thus 



I{9) = 2{E^'-\e,tf 

= J:,;, cos^ 9 + Jyy sin^ 9 + sm{29) [j-^yC^'^ + Jy^e'^] . (16) 



The Stokes parameters are then identified as 



/ = Jxx + Jyy = (alit)) + (alit)) , 

Q - J.. - Jyy = {alit)) - (alit)) , 

U ^ J^y + J*y = 2{a^{t)ayit) cos{(l)^{t) - (l)yit))) , 

V ^ -i [J^y - J*y] = 2{aAt)ayit)sini^At) - ^yit))) . (17) 



cy 

The measured intensity in terms of the Stokes parameters is 



I{9) = cos^ 9{I + Q) + sin^ 9il - Q) + ^ sin(26l) [{U + iV)e-'^ + {U - iV)e+'^] . (18) 
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The Stokes parameters can thus be determined by choosing various combinations of 6 and 9 and measuring I {9). 
(Note that the assumption of quasi-monochromaticity is actuaUy not necessary to define the Stokes parameters, e.g. 
see jivjV 

A few properties of the Stokes parameters and the associated coherency matrix J are worth noting. The Stokes Q 
parameter measures the amount of linear polarization in the beam in the x or y-directions. U measures the linear 
polarization in the directions at an angle 7r/4 to the x axis in the x-y plane. V measures the amount of circular 
polarization. If the wave is perfectly monochromatic, the amplitudes and phases of the electric field components do 
not vary in time. Then we may remove the time average brackets in Ean.ljlTII and there is the following relation 
between the Stokes parameters: 

= 0^ + + . (19) 
For the general case, this constraint becomes an inequality instead: 

> Q2 + JJ^ + y2 . (20) 
The matrix J is obviously Hermitian, J* = Jji. The determinant of J is: 

det[J] = \[I^- (Q' + U^ + V^)] > . (21) 
The polarization magnitude H (or degree of polarization) is a dimensionless quantity defined by 

EE + + = 1 - 4 det[J]//^ ■ (22) 

Note that most authors use the dimensionless polarization magnitude 11 as defined here, but some prefer to use 
dimensions of specific intensity (by multiplication by the total intensity) or brightness temperature. A beam with 
n = is said to be unpolarized. A beam with 11 = 1 is said to be a pure state (this terminology stems from the analogy 
^ between coherency matrices and density matrices in quantum mechanics) . By Eqn. H19() , a perfectly monochromatic 
beam (as opposed to a quasi-monochromatic beam) is a pure state. 

If several quasi-monochromatic beams all with the same mean frequency are superimposed, and the electric fields 
of each beam have phases which are varying completely independently of the phases of the other beams, then the 
coherency matrix of the total beam is simply the sum of the coherency matrices of the separate beams. An elementary 
proof may be found in — the gist is that in the forming the time average of the quadratic products of the sum 
of the electric fields, the cross terms between separate beams vanish (by the assumption of the independence of the 
phases). Beams with electric fields with no permanent phase relations are said to be incoherent. We will always 
assume, in summing two beams with the same direction and frequency, that the beams are incoherent and thus that 
the coherency matrices may be summed. 

A general polychromatic beam can be constructed by superimposing an arbitrary number of quasi-monochromatic 
beams. The coherency matrix elements and Stokes parameters may then be considered to be functions of frequency 
(spectral Stokes parameters). It is worth mentioning here that there is an alternative approach which yields spectral 
Stokes parameters without going via the route of the assumption of quasi-monochromaticity, developed by |2| . This 
uses some of the machinery of the theory of stochastic properties (see for example 34] ) . We will give a brief description. 

Consider again the example of a beam propagating in the z-direction with (real) electric field components e[^^ (t),i, j G 
{x,y}. First the auto- correlation {Rii{T)) and cross-correlation {Rijir)), i j) functions are defined: 

i?,,(r) ^ (i?W(t)i?f )(t + r)) ^,J G {x,y} . (23) 

Here the electric field components are assumed to be stationary stochastic processes, and the correlation functions 
are accordingly functions only of the time difference r. The angle brackets denote the expectation value. With the 
assumption of ergodicity (essentially that the expectation value is equivalent to a time average over a sufficiently long 
time interval), the expectation value is given by Eqn. ®. We now define the auto-spectral density functions Sii{uj) 
by a Fourier transform: 

Su{lu) = - I RuiT)e-'^^dT . (24) 



1 See for example H and I2I. 
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It follows from the assumption of ergodicity that these give the power in a given Fourier mode of the corresponding 
electric field component: 

Suiu;) ^ lun -^\E,{cj)W (25) 

where Ei{Lo) are the Fourier transforms defined in Eqn. Thus Eqn. I|24f) is just the Wiener-Khintchine theorem. 
The Sii are real. Similarly the cross-spectral density functions Sij{uj), i ^ j are defined by 

S,,{lo) = - j R,,{T)e-"^-dT (t^j). (26) 

I" J-oo 

The Sij{i j) are complex in general, and satisfy Sxy{i^) — S*^{lo). In Barakat's scheme, the functions Sij{uj) are 
related to the spectral Stokes parameters in a completely analagous fashion to the coherency matrix elements, except 
there is no need to invoke quasi-monochromaticity. This is perhaps a more satisfactory approach than the original 
treatment of Wolf. 

The polarization state and intensity of the beam associated with each frequency may also be considered to be a 
function of time. One can imagine decomposing the beam into a time series and Fourier analyzing successive segments 
of the time series to obtain the time dependence of each Fourier mode (this is what is actually done in polarimetric 
measurements of the time dependence of spectral Stokes parameters, see e.g. |l6j.'). 

In the next section, we describe a generalization of these coherency matrix methods to photon beams propagating 
in arbitrary spatial directions, which is the basis of our radiative transfer formalism. 



II. A TENSOR GENERALIZATION OF THE COHERENCY MATRIX 



The polarization state and intensity of a beam of light propagating in the 0-direction is characterized completely 
by the 2x2 Hermitian coherency matrix J^j, with (i, j) G {x, y}. There are several papers which study a description 
of polarized radiation transfer using the 2 X 2 coherency matrix 0, H S H IH Hi] . An obvious generalization is to 
allow (i,j) to become Cartesian tensor indices and to run over all of {x,y,z}. We obtain a 3 x 3 matrix: 

Q„- = {E,{t)E*{t)) , e {x,y,z} . (27) 

This matrix and its 4-dimensional generalization is one of the main tools in our formalism. It differs from the usual 
2x2 coherency matrix in that it is 3 x 3, the extra dimension corresponding to the direction of photon propagation 
n. Adding the extra dimension (and a fourth, when we introduce the covariant form in the next section) makes it 
much easier to handle the computation of the polarization of photons after general rotations, Lorentz boosts, and 
scattering. 

To our knowledge, only and Tl'l have systematically explored a similar approach previously. The matrix Qij is 
denoted the polarization matrix or polarization tensor (whether the 3-dimensional or 4-dimensional version is being 
talked about ought to be clear from the context). The polarization information is contained in the normalized version 
of Qij^ termed the normalized polarization tensor: 

cj, = . (28) 

Tr[Q] ^ ^ 

For a given photon direction n, the polarization vector is transverse, implying 

n'Q.j = . (29) 

It is useful to define a matrix with dimensions of specific intensity, also called the polarization tensor or matrix: 

= , (30) 

where the specific intensity / and the components of are associated with some mean frequency uj as discussed 
in the last section. The transition from the quasi-monochromatic case to the general polychromatic case may be 
taken as discussed in the previous section, and the components become functions of photon frequency. In general, the 
polarization matrix is a function of photon frequency (or momentum) and direction as well as space and time: 



lij = Iij{i^,n,x,t) 



(31) 
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Other conventions are also useful - in the computation of the Sunyaev-Zeldovich effects (SZE), it will be convenient 
to work with polarization matrices whose trace is either the occupation number n{i>, n, a;, t) or the phase space 
distribution function f{h',n,x,t) (associated with a particular photon momentum state and spatial position). Since 
the Stokes parameters are usually taken to have dimensions of specific intensity, we usually work with lij , but it is 
occasionally useful to use the other forms. 

Now in the usual description of polarized light, the Stokes parameters are defined with respect to a particular choice 
of "polarization basis". This is a pair of mutually orthogonal unit vectors e^^^e^^), both orthogonal to the beam 
direction. The Stokes parameters Q and U depend on the orientation of these vectors. By contrast the polarization 
matrix is a tensor and its components in any basis contain all the information about the polarization ellipse. Its 
advantage is that there is no need to rotate axes to define Stokes parameters. The Stokes parameters are given in 
terms of the polarization matrix and the polarization basis vectors as: 

l( I + Q U + iV\ (a) (b)j , ,x ^ r,^ 

2\V-iV I-Q ) " ' [a,b)e (1,2), (32) 

(the sum over the Cartesian indices ij is implied) which is just the previously defined coherency matrix J of equation 

It is of interest to see how the Stokes parameters transform if we choose a rotated set of basis vectors. In the case 
of a beam propagating in the z-direction for example, we have, choosing polarization basis vectors e*-^-* — x, e*^^^ — y, 



. I + Q U + iV Q 
-iV I-Q 




I,,^-\U-iVI-QO\. (33) 



If the basis vectors are rotated clockwise (according to an observer looking in the direction of propagation) through 
an angle x, the new set of basis vectors is 

e^-* ~ cosxe'"'"-' + sinxe*-^' , 

e^-* = cosxe^^-* — sinxe*-"'"-' . (34) 
Forming the matrices e^ji^^e^ji j, with (a, 6) € {1,2}, the primed Stokes parameters according to Eqn. (1221) are: 

/' = / 

Q' ^ Q cos 2x + U sin 2x , 
U' = U cos 2x-Q sin 2x , 

V' = V . (35) 
These transformations are also obtained directly from Qij by forming the rotation matrix: 

cos X — sin X 

Rix) = I sinx cosx I . (36) 



1 



Then 




U' + iV 

iV I'-Q' ) ^R{x)QR^{x) ■ (37) 


These factors of cos 2%, sin 2% in the transformation law are well known and associated with the fact that the linear 
polarization is described by a "headless vector" which is invariant under a rotation through tt radians. 

Now, given a set of matrix elements lij, supposed to represent a beam propagating in the direction n, how do we 
go about deciding if this matrix can represent a physical beam? Clearly the matrix must be Hermitian and satisfy 
lijn^ = 0. This yields a matrix whose elements contain four independent real quantities. In addition, the elements 
must satisfy some analogue of the relation between the Stokes parameters Eqns. ((T^ or if^ . The required condition 
is apparent from Eqn. H21|l — the eigenvalues of the matrix / must be non-negative. 
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Another obvious question to ask is, how does one construct the matrix of an unpolarized beam propagating in a 
general direction n? The only quantities we have available to construct the matrix are the intensity /, the components 
of the direction vector n, and the Kronecker delta (5y . The matrix must therefore be of the form: 



lij (n) = ASij + Briin 



3 ■ 



Now the matrix of an unpolarized beam propagating in the z-dircction is obviously 




(38) 



(39) 



Comparing this with the form of Eqn. (|38|l for the special case iii — 5iz, we see that A = —B = 1/2. Thus the matrix 
of an unpolarized beam in a general direction n is 



niJiji 



where we have defined the projection matrix P which will figure prominently later. 

The polarization magnitude (squared) of the beam described by a general matrix is given by 

2Tr[/2l 



Tr[/] = 



1 



(40) 



(41) 



This is readily checked with the matrix (|33|l of a beam propagating in the z direction. To see that this relation is true 
for any beam, we need only note that the matrix of a beam propagating in a general direction is related to l|33|l by a 
similarity transformation with an orthogonal rotation matrix, which does not change the traces in Eqn. I|41|l . Note 
also that reality of the right hand side of Eqn. H41|l follows automatically from the Hermiticity of I (since / and 
are Hermitian, and the trace of a Hermitian matrix is real). 

It is useful to write the matrix P{n) as an expansion in spherical harmonics. First we expand the matrix in 
powers of the Cartesian components of n, and then identify the result with expressions for the spherical harmonics 
in terms of the same components. Any spherical harmonic can be expanded in terms of the complex quantities 
(zi, 22,2:3) = (sin0e"^,sin0e~"^,cos6'). In terms of these functions we may write n = {{zi~\-Z2)/2^i{z2 — zi)/2,z^). The 



I = 2 spherical harmonics may be written as products of pairs of z's as follows (note 22 

^2,0 = 

^2,2 = 



^1^2 — 1 ^ ^3) 




The components of P{ri) are: 



P{n) 



1 - nj. -nxTiy -nxUz 

-UyTlx l-n^y -Uyfl^ 



Thus in terms of the spherical harmonics, the projection matrix becomes 

P(n) = ^/ + Re [y2,o(«)] ^0 + ^ (Re [Y2,M] A„ + Im [y2,m(n)] B^) , 

m=l,2 

where I is the identity matrix, and 

Ao = 



(42) 



(43) 



(44) 




B. 






1 
1 

1 o' 

1 




(45) 
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Thus 

Pin)=J2AimYirn{n) , (46) 

Im 

where 

. 1671- A A 

— J ^20 — ^0 , 

A22 = \iA2- 1B2) , A2^-2 = Al^ . (47) 

This way of writing the projection matrix comes in handy when performing the angular integrals in the transfer 
equation, and for numerical computation. 

In the computation of the Sunyaev-Zeldovich effect in the single scattering limit, derived in detail in Paper II, we 
have a situation where the scattered beam consists of an unpolarized component plus a small polarized perturbation 
proportional to the optical depth to scattering, r. It is useful to compute at this point an expression for the polarization 
matrix of the total beam to first order in the intensity of the perturbation. From Eqn. (|40|l . the beam has polarization 
matrix 

/.,(n)=/0 +rA/,,(n) , where 4 = |i^,,(n) , (48) 

or in matrix notation, / = /" + tA/, and J" = (/o/2)P(n). Substituting this into Eqn. H41|l we find, in matrix 
notation 

n ^ 2TTr [2/° A/ - /o A/l + t2 (2Tr[A/2l _ TrfA/l^) ^ ^ 

Now the unpolarized part of the beam is just a projection matrix multiplied by a scalar, so it has the property: 

Tr[7°A7] = ^ Tr[A/] . (50) 

Therefore the first trace in the numerator in Eqn. H49|l vanishes. The second term in the denominator can be ignored 
in the limit of a small perturbation intensity, and the squared polarization magnitude reduces to 



Tr[A/] 
Tr[/o] 



2Tr[A72] 
Tr[A/]2 



(51) 



In other words, the polarization magnitude of the total beam is just that of the polarized perturbation multiplied by 
the ratio of the intensity of the polarized part relative to the unpolarized part: 

n(J° + rA/) « r f ^i^) n(AJ) . (52) 



Tr[JO] 

Finally in this section, we note that the polarization matrices of incoherent beams associated with the same direction 
and frequency may simply be summed, by an obvious extension of the proof for coherency matrices mentioned in J] 



III. EXTENSION TO A COVARIANT POLARIZATION TENSOR 



The discussion so far has been in terms of electric fields measured in a particular Lorentz frame. In treating 
problems involving scattering from a moving medium, it is necessary to Lorentz transform the fields between frames. 
This can be done explicitly by writing down the time dependent electric and magnetic fields of the waves, and using 
the transformation law of the fields. However it turns out to be much simpler to use an extension of the matrix 
approach we have described in which the beam is described by a second rank tensor on spacetime. In this approach 
the Lorentz transformations become simple tensor (or matrix) relations. Indeed a full development of the radiative 
transfer of polarized light on a curved spacetime is possible with this covariant formalism. In this section we work in a 
curved spacetime initially but eventually restrict to flat spacetime, which is adequate for our application to the SZE. 
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We use the Minkowski metric with the convention = diag{— 1, 1,1,1}. The coordinates of a point in spacetime will 
be denoted either abstractly as x, or as an upper index quantity x^ — (t, x, y, z). Latin indices will denote components 
in the orthonormal basis {cx, ey,ez}. 

A truly covariant description of the electromagnetic field requires introduction of the field strength tensor Fa/s, and 
indeed a covariant description of the polarization of light can be accomplished entirely in terms of the field strength 
tensor p^ . But we wish to maintain an explicit connection with the Stokes parameters which are defined as time 
averaged quadratic combinations of electric field amplitudes, as measured by an observer at rest in some Lorentz 
frame. Thus we must express the electric field amplitudes measured in the rest frame of a given observer in a Lorentz 
covariant manner. The rest frame of the observer along the light beam can be defined by specifying a differentiable 
time-like vector field v{x) giving the observer 4- velocity all along the light cone (with v ■ v = —1). 

To generalize the coherency matrix of the previous sections, we need to find a covariant way to describe the time 
averaged product of electric fields. This must be done by constructing the electromagnetic field strength tensor for a 
plane wave in the WKB (or shortwave) approximation of geometrical optics (see e.g. QI^H^). In this approximation 
we treat the antisymmetric electromagnetic field strength tensor f^j/ as a test field (meaning that we may ignore the 
influence of f^j/ on the gravitational field) and assume that there are no charges or currents in the region we are 
considering. The field tensor thus obeys the source free Maxwell equations: 

V^F^" = , ^aF^, + V^F,„ + V,^^„M = . (53) 

The geometrical optics approximation consists in assuming that the field strength tensor can be written as the product 
of a slowly varying complex amplitude and a relatively rapidly varying phase factor: 

F^, = 5Re{i?^,(x)exp[z(^(a;)/e]} (54) 

where e ~ A/i is a perturbation parameter with A being the wavelength and L the length-scale over which the 
amplitude F^^, changes (roughly the local radius of curvature of spacetime) . In the geometrical optics limit we expand 
the Maxwell equations in an asymptotic series in e, take the limit e ^ 0, and read off the lowest order terms. Then 
e is absorbed into (p{x)^ by replacing ip{x)/e with ip{x) and then dropping the tilde. The lowest order terms describe 
the evolution of electromagnetic waves which, on scales which are large compared to A but small compared to L, are 
plane and monochromatic to an excellent approximation. 

Substituting equation (|54ll into the Maxwell equations (|53|l and working to lowest order in e, we obtain 

K.F^" = , 

= . (55) 

where the wavevector is a one-form field normal to surfaces of constant phase, defined by: 

fc^(x) = Vpi^ . (56) 

It follows from this, and the fact that covariant derivatives commute when applied to a scalar field, that 

V^fc^ = V^fc^ . (57) 

Contracting the second equation in (|55|l with fc", and assuming that F^i, vanishes only on hyper-surfaces, we find 

fc^fc^ = . (58) 

Thus the wavevector k is null. If desired we may associate a photon 4-niomentum p — hk with the wavevector, and 
go over to a particle description. The frequency of the wave as measured by a local observer with worldline x^{t) and 
4- velocity — dx'^/dr is given hy uj — —k ■ u — d(p/ dr (taking e = 1). Eqns. (|57|) and (|58|l imply that the wavevector 
is parallel transported: 

Vfefc^ = fc"V„fc^ = . (59) 

The curves a;''(A) with dx'^/dX — k^ are called light rays (A is an affine parameter along the ray). Note Vfc = 
fc'^V^ — d/dX is the directional derivative along the ray. As a consequence of Eqn. l|5^. the system of rays is equivalent 
to a Hamiltonian flow for particles with Hamiltonian 



i?(x,A:) = -5^''(x)fc^ft, 



(60) 
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Hamilton's equation dk^/d\ = —dH/dx^ is equivalent to Eqn. (|59|l . which is the geodesic equation for photons, while 
dx^ /dX ~ dH/dk^ gives the advance of the wavefront along the ray. The Hamilton- Jacobi equation H{x^ V(/3) = is 
also known as the eikonal equation for the phase factor (f{x). 

Now we would like to express the components of F^^ — —F^^ in terms of the electric field. Writing a propagation 
equation for the electric field requires that we have a differentiable time- like vector field {v ■ v = —\) giving the 
4-velocity (hence rest frame) of observers all along the light cone. In other words, the electric field is defined with 
respect to a family of observers with 4-velocity v{x). In the local Lorentz frame at point x of the observer with 

4-velocity v[x), the electric field components are = F^^ and the magnetic field components are = \iii^^F^"^ 
where Latin indices range over the spatial components and the carets indicate an orthonormal basis, with eg = v. 

The transversality from equation (|55|l implies = ^ifm^'^™ where fc is a spatial unit vector along the wavevector. 
In a general basis, we promote the electric field to a 4-vector 

= v^F^, . (61) 

By antisymmetry of F^^, i?^ is orthogonal to the 4-velocity of the observer v^: 

v'^E^ = . (62) 

In the geometrical optics limit we may define the complex amplitude of the 4-vector electric field using the complex 
amplitude of the field strength tensor: 

E^ = v'F^, , v^'E^ = . (63) 

Thus 

E^ = ^e{Ef, exp[iip{x)/e]} . (64) 

Eqns. H55|) imply k^E^^ = k^Ef^ = 0, which correspond to the transversality of the electric field to the the wavevector. 
The electric field 4-vector may be factored as 

E,, = Ee^ , (65) 

where e*is a vector which satisfies g^'^e*^e^ ~ 1, called the electric polarization vector. In the rest frame of v, this 
reduces to a 4-vector with spatial parts equal to the usual polarization 3- vector. 

Contracting the second of Eqns. (|55ll with and substituting Eqn. (|63|l yields an expression for the field strength 
amplitude in terms of the electric field 4-vector amplitude: 

Ff,^ ^ k^\k^E^ ^ k^Ef,) where k = -k^v" . (66) 

Next we would like to know how the amplitudes Ffj_i, and Ef^ change along a ray. We proceed by computing the 
divergence of the second of the Maxwell equations H53|) . 

g^^y^ [V„F^, + V,,F,^ + V,F^^] = . (67) 

Note that swapping the order of the covariant derivatives in the second two terms kills each term by Maxwell's 
equations. Thus using the following identity for the commutator of covariant derivatives in terms of the Riemann 
tensor, 

(VpVi. Vi.Vp)S'q/3 = -SapR^at^iy ^ SacrR" fj^j^y , (68) 

we find a wave equation for F^i, with curvature terms: 

g"^Va.yfjFf,, ~ R^,c.F% + R.o.F'^^ + i^^.a/jF"^ = . (69) 

Substituting equation (|54|l and working to the two lowest orders in e, one finds the following equation for the evolution 
of the field strength amplitude (the Riemann tensor terms do not appear to this order) : 



VfcF^, = --dF^,, , e = V„fc" . (70) 



The amplitude of the electromagnetic field changes along rays due to curvature of the wavefronts. For example, 
diverging rays {0 > 0) lead to a decrease in the electromagnetic field strength as the wave propagates. 
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Substituting equation (|66|l into equation (|70|l now gives an equation for the electric field evolution along a ray, 

1„\ ~ k 



VkE^^ l^Vkhik^-ejE^ + -^iVkv")E^ . (71) 

Factoring the electric field into its magnitude and direction (polarization) vector, = iSe^ where g^'^ e^^u = 1 find 
v^e^ = 0, we obtain 

Vfc^ - [Vklnk-U^ E , (72a) 

Vfce^ = ^(Vfci>")e„ . (72b) 

The first of these equations yields for example the 1 /r fall off of the electric field magnitude expected for a radiation 
field. The right-hand side of both equations vanishes for a plane wave in flat space, but not for a curved wavefront 
(e.g. a spherical wave), or for a wave propagating in a general curved space. 

It is perhaps surprising that the electric polarization vector is not parallel transported in a curved spacetime. This 
fact leads to a rotation of the polarization vector when a beam passes through a strong gravitational field. This effect 
has been noted before by several authors "27", "3^,® 113, and is important in considering for example the propagation 
of polarized radiation in the vicinity of a black hole. However it is true that if one defines the polarization vector to be 
parallel to the vector potential rather than the electric field of the electromagnetic wave, then it is parallel transported 
in the geometrical optics limit (see e.g. [sj, [13 )• This turns out to be consistent with the electric field evolution due 
to the enforcement of the gauge choice of the vector potential all along the photon worldline. Thus in considering the 
propagation of polarized photons on a curved spacetime it is more convenient to use a polarization tensor constructed 
from the vector potential to evolve the polarization state along the ray, and then make the transformation to electric 
fields. If the photon path does not pass through regions with an exceptionally strong gravitational field however, the 
rotation of the polarization vector resulting from this gravitational effect is small (but note that, strictly speaking, the 
rotation arises from the acceleration of the local observers, V^w" = dv^/dX, which can be large even in flat spacetime 
if a peculiar vector field of observer 4- velocities is chosen). In considering the propagation of photons through a 
cluster of galaxies for example, the effect is entirely negligible, and so henceforth we will restrict the discussion to 
flat spacetime, and work with the more physical polarization tensors we defined in terms of the electric fields. In flat 
spacetime we may drop the right hand sides in Eqns. H72|l . 

Having described the propagation of electromagnetic waves in the geometrical optics approximation, and defined the 
electric field in a covariant manner, we are equipped to construct the covariant version of the coherency matrix. We 
consider a plane electromagnetic wave propagating in flat spacetime, in the geometrical optics limit, with wavevector 
k and associated photon momentum p. Henceforth we will write the complex amplitude of the 4-vector electric field 
of the wave as , dropping the tilde for brevity. The 4-vector E has the property that its spatial components E'^ 
in the rest frame of the observer, in which — (1, 0, 0, 0), are equal to the measured electric field, and also E'^ — 
in this frame. Thus by analogy with the 3x3 polarization matrix Qij defined in Eqn. 127|l . we are lead to define a 
complex valued rank (0, 2) tensor called the polarization tensor: 

Q^.{x,p,v) = {E^ED . (73) 

The spatial components of this tensor in the rest frame of the observer are entirely equivalent to the elements of 
the 3x3 coherency matrix considered in the previous section. It is related to the stress-energy tensor T^'' = 
pn^piya _ ^gt^'^ F°'^ Fa0 (in Heavisidc-Lorentz units). In particular, the time-average energy density in the geometrical 
optics limit is Vf^v^, (T^'') — Q = Q^^ where angle brackets denote averaging over a few periods. Note that Q^'^Vf^ — 
Q^'^Pn = (where p^ is the four momentum of the photon). 

To define Stokes parameters, we need to specify a set of polarization basis vectors. The natural choice is the 
orthonormal tetrad basis vectors {ea}: 

6*0 = W , e*3 = V- V , 6*1 , £2 (74) 

where p = —v -p. These vectors have the property e*a • q, = ijab- Latin indices {a, 6, . . .} are tetrad indices; Greek indices 
i^, . . .} are coordinate indices. The spatial direction of the photon momentum for an observer with 4-velocity eo is 
£3. The remaining basis vectors, ei and 62, give the physical polarization space. We call this tetrad the polarization 
tetrad. There are associated basis 1-forms {e°}), which are dual to the basis vectors: (e°,efc) = (5"^. (Note that the 
polarization tetrad depends on the photon momentum, i.e. = ea{x,p). Thus, in general a different basis is needed 
for each photon momentum) . The coherency matrix of Eqn. H12|l is then given by 

Jab = e^h QA^,P, v) , (a, fe) e (1, 2) . (75) 
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In the case of beam which has a definite polarization vector e* (lying in the polarization subspace spanned by {ei, 6*2}) 
which does not vary with time, i.e. a pure beam, the polarization tensor is given by 

Q^tu{x,p,v) = Q e^€u . (76) 

So far we have used the tensor to describe a polarized EM wave. However if we wish to consider energy transfer 
between photons and some scattering medium, free electrons for example, we must consider the trajectories of photons 
in phase space. To describe an ensemble of polarized photons we must define a distribution function on phase space. 
The matrix Q'"' is not very useful because it describes a single classical system (the classical counterpart of a pure 
state) with specified wavevector fc^(x). Developing a kinetic theory requires an ensemble of systems encompassing a 
continuous distribution of wave-vectors at each spacetime point. We accomplish this heuristically by analogy with 
the usual treatment of the unpolarized case. 

In general, the stress-energy tensor of a system of photons may be written (in flat space) as 

T^'ix) = I ^P>V(^,P) , (77) 
J P 

where p = and f{x,p) is the (unpolarized) photon distribution function, which determines the total number of 
photons in the quantum state corresponding to phase space element (Px (fip according to 

dN = f{x,p) (fxSp . (78) 

The occupation number is given hy n{x,p) = f{x,p). It is not hard to prove that n and / are Lorentz scalars (see 
for example [s^l for a proof). 

To incorporate polarization we define the distribution function polarization tensor ff^^(x,p,v) in a manner similar 
to the scalar distribution function f{x,p). The polarization tensor of a general superposition of waves at a given 
spacetime point, according to the local observer with 4-velocity v, may be defined as Q^v{x,v) = (E^E*). Then by 
analogy with (|77|l we have 

QM- ^ p2 JM. (^^ ^) . (79) 

J P 

This obviously does not uniquely define ff^^{x,p, v). A rigorous definition requires a more sophisticated discussion (as 
in |3]). However we do not run into any difficulties if we simply regard ff^u{x,p,v) as a tensor generalization of the 
scalar distribution function which satisfies Eqn. (|79|) . f"^ has the property that /"''e^e^ d^p d^x is proportional to 
the number of photons in the phase space element d^p d^x passing per unit time through a polarizer oriented to select 
polarization (where this 4- vector must lie in the polarization subspace spanned by the vectors ei, e*2 of Eqn. 174(1 ). 
Contraction with the metric yields the scalar distribution function: 

9f.uF''{P,v)^fiP) ■ (80) 
The distribution function tensor also has the properties 

The generalization of Eqn. 1)41(1 for the polarization magnitude is 

U^^p^^irimi^.,, (82) 

which is manifestly a Lorentz scalar. 

Similarly to the case for coherency matrices and 3x3 polarization matrices, the polarization tensors (p, w) 
and (p, v) of two incoherent beams associated with the same photon momentum may be summed to yield the 
total polarization tensor. We shall always make the assumption that two separate beams are incoherent and have 
polarization tensors that may be superimposed in this manner. 

It is useful to define a covariant polarization tensor with dimensions of specific intensity (whose components are 
combinations of Stokes parameters). In the unpolarized case, the specific intensity I{x,p) is introduced by defining 



T'>" = / d-'ppf{x,p) = / dvdVlI{x,p) , (83) 
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where is the sohd angle element about the photon direction. It follows from (|77|l that 

I = hp^ f = h^v^ f = hv^n . (84) 

We define a specific intensity (or brightness) tensor by analogy with the unpolarized case: 

I^^-{x,p,v) = hpT{x,p,v) . (85) 

The intensity polarization matrix is zero outside from the two-dimensional polarization space spanned by e*!, where 
it may be written in terms of the usual Stokes parameters /, Q, U, and V: 

f - ^ f ^ + ^ U + tV\ , . 



2hp^\U~iV I-Q 

The Stokes parameters are functions of frequency (photon energy) ; / = is the spectral intensity. In an arbitrary basis 
the intensity is / = [hv'^ /(?)g^^ f^^. The normalization factor is chosen so that /n is the photon occupation number 
(phase space density divided by h^) for photons passed by a linear polarizer oriented along ei (and similarly for other 
directions). In terms of the total spectral intensity, we may write the polarization magnitude as 11^ = 2(7^^)/^''/^^ — 1. 

Now we wish to obtain an equation for the evolution of /'"^ in time. In the absence of scattering, photons follows 
geodesies (free stream) and the distribution function evolves according to the Liouville equation. The Liouville 
equation for the unpolarized distribution function is simply Df/dX — 0, where A is an affine parameter along the ray: 

dx^^dj_ dfp^df_ ^ 

dX dX dp^, ■ ^ ' 

This may also be written in the more familiar form (valid in curved spacetime) 



Vp/-rV,P>''0 = where Vp = p"V„ , 



provided one regards / as a function of the 3-momcntum in some frame, / = f{p) (not / = f{p)), by enforcing the 
mass shell constraint p^Pf^ = 0. 

Now we consider the generalization to the polarized case. From Eqns. ()72|l it follows that the evolution equation 
for Qf^^ in the geometrical optics approximation in flat spacetime is 

VpQ^, = . (89) 

This is suggestive that the Liouville equation for /^^ in flat spacetime is simply 

= . (90) 

This obviously reduces to the correct evolution of the unpolarized distribution function on taking the trace. This is 
in fact the correct Liouville equation in the polarized case. The proof is easy and goes along the following lines, [give 
proof] 

Generalizing this to a curved spacetime is more difficult. The evolution equations for the 4- vector electric field 
produce unusual terms. But it can be shown that using a polarization matrix based on the vector potential, the 
Liouville equation is simply given by Eqn. (|87|l with tensor indices added. We will not derive this result here, but 
refer to the discussions in jiidjiij S,, .9, ,17]- 

So in flat spacetime, the Boltzmann (or kinetic) equation for the distribution function polarization tensor is 

P"<9„/^, = C^, , (91) 

where the effect of scattering is contained in the scattering term C^v The form of the scattering term in the case of 
Compton scattering is derived in the limit of negligible electron recoil in ijVIl and in the general case in Will 



IV. LORENTZ TRANSFORMATION PROPERTIES OF THE POLARIZATION TENSOR 



On performing a Lorentz transformation between inertial frames, it is well known that the propagation direction 
of an EM wave (or equivalently, photon) is aberrated and its frequency (or momentum) Doppler boosted. The 
transformation of the polarization state of the beam is less well known. Here we derive the transformation law of the 
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polarization tensor between frames. This leads to the transformation law for the Stokes parameters, which turns out 
to be very simple (in fact, they are invariant) provided a certain choice of polarization basis is made. 

First, we find the transformation of the 4- vector electric field E'^{v) under a change of the local observer vector 
field from v{x) to v'{x). The spatial components of E^{v) are the electric field (3- vector) components measured by 
the observer with four- velocity v (in her rest frame). Let us find the relationship between the electric fields E^^{v) 
and E^^{v'). To determine this, recall from (|66|l that the definition of i?^ implies the following relation between E^ 
and the field strength tensor for a plane wave: 



F^,^ = p~\p^,E,. - p^Ef,) , p = -p-v. (92) 



Therefore, since E^{v) — F^^Vy, 



E^iv') = ?-(g^^^ + tyjL\ E-{v) , (93) 
P \ P ' 



where p' = —v' ■ p. These relations suggest introduction of a tensor P^^(p, {j") which projects onto the physical 
polarization plane €1-62 by eliminating components in the surface spanned by eg f-^d £3 (or ^ ^^^id p): 

1 J) D 

= Qtiu ^ -{Pfj^Vu + PuV^j) ^ where p=-v^p^. (94) 

This satisfies the idempotency relation P'^^P"'^ — P'^'^^, so that P'^^, is a projection tensor, henceforth denoted 
the screen projection tensor which will prove to be important. The transformation law for measured electric fields, 
equation H93|l . may be written in terms of the screen projection tensor as follows: 

E,iv') = <^P'^ ip,v')EAv) , (95) 

since the second and the last term in P''^ vanish when contracted with E^ . In the geometrical optics limit, taking 
components in the appropriate Lorentz frame, Eqn. H95|l reproduces the usual relativistic transformation of electro- 
magnetic fields. The dependence on the four-momentum appears because the boosted electric field depends on the 
magnetic field, which in the geometrical optics limit is j5 x E. 
The transformation law of Q^'^{x, v) follows from Eqn. H73|l : 

Q""'"' (P, ^) = (^l) P'-'^iP, ^)P'''M ^) Q^^ip. • (96) 

Since the integration measure in Eqn. H79|l is Lorentz invariant, the transformation of Q^^ implies that f^^ transforms 
in the following way under change of the local observer 4-velocity: 

r''^' {p, v') = p^;(p, ^)p%{p, ^)r{p. v) . (97) 

Note that the following transformation property of the specific intensity tensor follows immediately from the trans- 
formation properties of the distribution function and Eqn. (|85ll : 

{p, ^) = ( t|) P^'M ^)P%iP, v')I^^''{p, v) . (98) 

In the unpolarized limit, the trace of this reduces to the familiar statement that I /v^ is Lorentz invariant. In the 
general polarized case, one sees that all of I/v^, Q/v^, U/v^, V/v^ are invariant under a boost along the photon 
direction. 

The transformation properties of the Stokes parameters under a boost in a general direction are clearly dependent 
on the polarization basis chosen in each frame. To work out the general case, we consider the transformation from 
frame K' (the rest frame) with 4-velocity We into frame K (the lab frame) with 4-velocity vi. In lab frame coordinates, 
let = 7(1, w). In the rest frame, the brightness tensor P' (p, Ve) contains all polarization and intensity data of a 
photon with 4-momentum p. We denote the photon momentum in rest frame coordinates, as p'^ = p'(l, ^i')j in 
lab frame coordinates as p^^ = p(l, n). 

The Stokes parameters measured in K' are defined by specifying a set of orthonormal polarization basis vectors 
{si, S2}, where si • p = 5*2 • p = 0, and si ■ Ve = S2 ■ Ve = 0. Since the vectors Sa are purely spatial in the rest frame, 
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we may write = (0, e^), a € {1,2} with e'^ • n' = 0, e[j • = 5ab- The Stokes parameters measured in K' are 
determined by the quantities: 

J'ab = I'''''\p.v,)sat,.Sbu- , {a,b) e {1,2} . (99) 

To determine the Stokes parameters measured in if , we must specify lab frame basis vectors {ti,t2} which satisfy 
ti ■ p = t2 ■ p — 0, and ti ■ vi = t2 ■ vi = 0. We write = (0, Cq), a G {1, 2} with Ca ■ n ~ 0. The analogous quantities 
to those in Eqn. in K are 

Jab = I'"'{p,vi)ta^tb. , {a,b) e {1,2} . (100) 

The vectors {ta} are not uniquely determined, but there is a natural choice of basis which keeps the transformation 
of the Stokes parameters simple. Applying the transformation law (|98|1 to Eqn. (jlUUfl . and replacing 

rf'ip, V,) ^ p%(p, v,)p^g{p, v,)r\p, ve) , (101) 

we obtain 

Jab^(^^}j P''\siP^^e,Vl)P\p,Ve)ta>.tb, . (102) 

where 

P'^^piP, Ve, Vl) ^ P^ip, Vl)P^M ^^)P",iP^ ^e)P^siP^ ^e) , (103) 

and 

p' = -Ve-p, p=-Vi-p. (104) 

Comparing Eqn. p02|l to Eqn. H99|) . it is apparent that if we demand that the vectors ta satisfy: 

P%(p, Ve)P\{p. m) ta^ = Sa-y , (105) 

then the transformation law of the quantities Jab reduces to 

Jab - J'ab , (106) 

and thus the Stokes parameter Q, for example, transforms simply as 

Q-{j)\'. (107) 

and similarly for the other Stokes parameters. Since ta^ is assumed to be purely spatial in K {ta • "5; = 0), it follows 
that P^^{p,vi) ta^ = taa, and the transformation H105() simplifies to 

P''^ip,Ve)tao. = Say. (108) 

In 4-vector notation, using p ■ ta = we find 

Sa ^ ta + ^{Ve ■ ta)p (109) 
P 

This manifestly satisfies • = 0. Since ta must be purely spatial in K we have 

ta - Ve= { — ] Sa- Vl , (110) 



which yields 



P 



ta ^ Sa- -{Vl ■ ta)p . (Ill) 
P 
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The transformation law of the polarization basis 3- vectors Ca, now follows. Denoting Sq in lab frame coordinates 
as = (sJJ, Sq), and Lorentz transforming into K we obtain 



< + (7-l)^. (112) 



Thus the spatial part of Eqn. Ijlllf) yields 



ea = + (7 - 1)^^^^ - jn{v ■ <) . (113) 

V 

This transformation law was previously obtained by p^ . One may check, using the transformation law of n (see 
Eqn. (|196(l ^. that the polarization basis 3- vectors Ca are indeed orthonormal and orthogonal to n. The fact that such 
a complicated transformation of basis is needed to ensure that the Stokes parameters transform in a simple fashion 
demonstrates that the tensor approach is more convenient when dealing with relativistic transformations of polarized 
beams. 

The screen projection tensor P'^^{p, v) defined in Eqn. (|94|l is an important tool in this polarization tensor formalism. 
It projects onto the "screen" subspace orthogonal to the photon momentum p and local observer 4-velocity in the 
sense that it leaves f^'^{p,v) invariant: 

P'^ciP, v)r^{p, v) - F^ip, v) . (114) 

In a local Lorentz frame P^'^ is simply the 2x2 identity matrix in the subspace orthogonal to and p^^. It is 
appropriate now to discuss some of its properties, which will be useful to refer to in later sections. It may also be 
written in the form (used in ^.nd ^3 for example) 

P^iy = 5m<> + '"t^'^'v - nf,n^ , (115) 

where is a spacelike unit vector giving the propagation direction of the photon with respect to the observer: 

p — p {v + n) , n ■ V = , n-fi—l, n-p = p. (116) 

In the rest frame of the observer with 4-velocity v, the 00 and Oi components of Pfi,^{p, v) vanish, and the spatial 
components are given by 

Pij{p, v) = 6ij - n^rij . (117) 

where n is the photon direction 3-vector in the rest frame. By an obvious generalization of the argument leading to 
Eqn. H40|l . the distribution function tensor of an unpolarized beam is given by 

ff.M v) = \I[P)P^.M v) . (118) 

There are the simple properties: 

v^P^,{p,v) = , p^P^.(p,t/) = , 
g^.P^'^ (p, v) = 2, P^'' (p, v)P^, {p, v)^2, 
P\{p,v)P^,{p,v) = P\{p,v). (119) 

Contracting two projection tensors with the same photon momentum but different observer velocities yields, with 
Pi = -p- vi, p2 = -p- vh.: 

P\{P, vi)P''\p, V2) = + ^ + ^ + ^/p- . (120) 

P2 Pi PlP2 

Another contraction yields: 

P%{p, vi)P\{p, v^)p-'\p, V2) = P'^f'ip, Vl) , (121) 

which proves that a beam that is unpolarized according to some observer is also unpolarized according to any other 
observer. 
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Wc close this section with a demonstration of the Lorcntz transformation of the polarization state of a beam using 
polarization matrix manipulations. This will serve as an introduction to the more complicated matrix manipulations 
used in the derivation of the SZE later. 

We consider a photon with a general polarization state propagating in the 2;-direction with momentum p. We will 
compute the polarization matrix of the beam measured by an observer moving in the x-direction with velocity v. 
We work in the inertial frame K (miprimed) with basis 4- vectors e^, and observer 4- velocity vo = et with unprimed 
components = (1, 0, 0, 0). Let us consider a partially polarized photon beam propagating in the direction, with 
4-momentum p with unprimed components p'^ = p{l, 0, 0, 1), and distribution function polarization tensor f^^{vo) as 
measured by observer vq. We suppress the photon 4- momentum argument of the polarization tensor since we deal 
here with a monochromatic beam. 

We may perform tensor manipulations by defining 4x4 matrices with entries equal to tensor components, with 
no distinction between raised and lowered indices, provided there is a separate matrix for each combination of raised 
and lowered indices. Thus we define f'^'^{vo) = [f{vo)]ni, where 

















a 


b 








b* 


d 





\o 












(122) 



where the row elements from left to right and the column elements from up to down refer to the {t, x, y, z) components. 
Choosing polarization basis vectors ei = x, 62 = y, the coefficients (a, b, c, d) are related to the usual specific intensity 
Stokes parameters (here h = c=l): 

I + Q 



a = 



b* = 



2p3 ' 
U -iV 



b = 
d = 



U + iV 
2p3 

I-Q 
2p3 • 



(123) 



Now wc consider the polarization tensor measured by an observer moving perpendicular to the photon momentum 
in the unprimed frame, with 4- velocity v. The rest frame of this observer is K' . We take 4- velocity v to have unprimed 
velocity components v'^ = 7(1,1), 0,0), where 7 = l/Vl — v"^. Then the polarization tensor measured by the observer 
with 4- velocity v has components in the unprimed frame as follows: 



where the projection tensor P'^^ is given by: 



and p' 



-p ■ V 



P 

■ jp. In matrix form P'^^ = [-Pl]^j^, where 

"2 V v'^\ 



P^Pu 



P' 



/2 



(124) 



(125) 



Pi = 



— V 1 Q V 
10 



(126) 



The lowered index quantity P/^i, is represented by a matrix P2 with different entries: 







— V 







— V 


1 





V 










1 





v 




V 





J 



The idempotency relation satisfied by the projection tensor, P'^ aPtiP ~ Pa/S, implies the matrix relation 

P[P2 = P2 , 

which is satisfied by the matrices above. Using the projection matrices wc find f^^iv) = [f{v)]^^ where 

/ av^ av bv av^ \ 

f{v) = P,f{vo)P[-- 



av a 

b*v b* 



b av 

d b*v 



av^ av bv av^ 



(127) 



(128) 



(129) 
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Now we would like to obtain the Stokes parameters measured by the observer with 4-velocity v. This is given by 
the Lorentz transformed matrix /' — AIA^ , where in this case the matrix A is a boost matrix in the x-direction: 



/ 7 ~jv ' 

-7U 7 

10 

V 1 



(130) 



Thus the polarization matrix of the beam measured by the observer at rest in K' , in primed coordinates, is 

f'iv) 



/O 

a/7^ 6/7 av/j 
&*/7 d b*v 



(131) 



V aw/7 bv 



av 



1 



Adding the diagonal elements of this matrix yields a + d, since the total photon occupation number / ^'^rj^iv — I /p^ — 
a + d is a Lorentz invariant quantity. The ti and it elements are zero since the electric field 4- vectors in this frame 
are purely spatial by definition of the tensor f^'^{v). A matrix of this form is termed a physical polarization matrix^ 
since its elements correspond to quantities measured by a polarimeter in this frame. 

We now examine the Stokes parameters measured by an observer with 4-velocity v according to the polarization 
tensor derived, in order to check that our formalism agrees with the known transformation properties of the Stokes 
parameters. To obtain the Stokes parameters in the boosted frame, we need to define a photon polarization basis. 
The basis given by Eqn. H113|) should guarantee that the Stokes (divided by the cube of the momentum) are invariant 
under the boost. In the unprimed frame, the basis vectors were ei — ^2 ~ U- In the lab frame (here primed), the 
photon momentum is = p("f, —"fv, 0,1), the photon direction is n' = (—u, 0,1/7), and the lab frame velocity is 
v' = —vx. Substituting these into (|113|l yields 



'-1 



(0,1/7,0,1^) 



= (0,0,1,0) , (132) 

which are clearly orthonormal and orthogonal to the primed photon momentum, and reduce in the limit w ^ to the 
unprimed basis. A more general polarization basis is obtained by rotating these vectors through an angle x about the 
photon momentum, as follows: 



i cosxe'i -sinxca = (0,cosx/7,-sinx,t'cosx) , 

cos X £2 + sin X e'l = (0, sin x/ 1, cosx,v sin x) ■ (133) 



='2 ' ' 



In this rotated basis, the Stokes parameters in the boosted frame are given by the quantities 



a' 




/'4 


= a cos^ 


X 


-(54 


c) cos X sin X 


+ d sin^ X 7 


b' 


— ^1 


/'4 


= b cos^ 


X 


f (a - 


d) cos X sin x 


7 * • 2 

- sm X , 


d' 


- f'^ 

— ^2 


/'4 


= dcos^ 


X 


+ (& + 


c) cos X sin X 


■f a sin^ X ■ 



(134) 

Thus the measured Stokes parameters in the primed frame are given by: 

/' , / 

d' = a + d = — , 



^ a' — d' = [a — d) cos 2x — {b + b* ) sin 2% = cos 2x — sin 2% 



^ b' + b'* = (6 + 6*) cos 2x + (a - d) sin 2% = ^ cos 2% + sin 2% 



^' = i{b' -b'*)=i{b-b*) = ^ . (135) 



We find that with the choice x = 0; the Stokes parameters transform as claimed in Eqn. (|107|l . and with a general x 
the Stokes Q, U, V transform in the expected fashion under rotation of the polarization basis vectors. 

Fig. ^ illustrates the results of using this procedure to compute the polarization magnitude Il{fj,s,fs) of the 
radiation field produced by Thomson scattering of a polarized beam incident along the ^-direction (in lab) on a 
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FIG. 1: Polarization magnitude of the radiation field produced by Thomson scattering of a beam incident along the z-direction 
on an electron with velocity 0.7c also along the z-direction, for the following two cases of the polarization state of the incident 
beam: (a) Q = U = V — 0, (b) Q/I — 1/2, U = V — (where the Stokes parameters are defined in the basis ei = x, e2 = y). 




FIG. 2: Polarization magnitude of the radiation field produced by Thomson scattering of an unpolarized beam incident along 
the 2-direction scattering from an electron moving along the a;-direction with velocity (a) zero, (b) 0.4c, (c) 0.9c, (d) 0.99c. 

relativistic electron, as a function of the polar angles [fig = cosOs, (fs) of the scattered photon about the z-axis (see 
Eqn. H152|l '). The Thomson scattered radiation field in the electron rest frame is derived in the next section. The 
polarization matrix of the scattered beam in the electron rest frame was computed, as given in Eqn. (|148() . and then 
boosted to lab frame, where the polarization magnitude was computed. The polarization magnitude is a scalar and 
thus changes under the boost due entirely to the aberration of the photon direction. 
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FIG. 3: Thomson scattering of a pure incident beam from an electron at rest into a specified final polarization state. 



V. THOMSON SCATTERING 



In this section we present a derivation of the equation for the time evolution of the distribution function polarization 
tensor due to Thomson scattering from a distribution of stationary electrons, starting from the classical results for 
Thomson scattering, ignoring the effects of electron recoil and induced scattering. Note that throughout this and 
subsequent sections we work in flat spacetime. 

Recall that for a completely linearly polarized beam, Q^^'^e*^e^ is the time-average energy density for electromagnetic 
radiation of polarization , where is spacelike and normalized, €*-e = g'^'^e* = 1 . Consider a completely polarized 
beam with polarization vector Cj and momentum pi incident upon an electron at rest (Fig.|2J). The polarization matrix 
of the incident beam is Qiti ® e* where cQi is the incident flux (we choose units such that c = 1). Normalization of 
the polarization vector implies Qi — e*^eii/. In the Thomson limit, in which the electron recoil is negligible, the 
differential cross section for Thomson scattering of a beam into final momentum ps and polarization xse^ is |2^ 

Thus the power per unit solid angle in the scattered beam is 

_ = _Q,|e, ...I (137) 

where dfij is the element of sohd angle associated with the direction oipg. We may also write Qi\ej* ■is\^ = Qt^ (-1 ^i^sv 
Next consider a gas of electrons all at rest with number density Ue'. we work in the rest frame of the electrons 
throughout this section. Assuming incoherent scattering, multiplying Eqn. (|137|l by nedfts converts scattered power 
per electron to the rate of change of energy density in final polarization state eg ■ 

^^e^M^s" = ^^neQ'^'"el^es,,dns ■ (138) 

Note that the time derivative d/dt here should actually be interpreted as a total derivative taken along the ray, 
d/dt ^ d/dt + n ■ 'V , since eventually the left hand side of the evolution equation will be replaced with the left hand 
side of Eqn. H159|) . Using equation l|79|) . and setting pi — ps since we are working in the Thomson limit, we may 
convert this to the change in the phase space density matrix, giving 

—j^e*^^esudfls = -^Tieft^el^es^dn^dns . (139) 

We would like an equation for the change in /^^ due to scattering, but Eqn. H139() gives the change only for a 
particular (but arbitrary) polarization of the outgoing wave, e^. We cannot remove the polarization factors and 
conclude dfl^"^ cx ff^'^ because the polarization of the incoming wave does not lie in the same plane as the polarization 
of the scattered wave. For a given outgoing momentum ps, the outgoing polarization is a linear combination of the 
two basis vectors ei and 6*2 (associated with the photon of momentum ps) of HIIII Thus, fi^f*si^isu projects out of 
the incoming density matrix /'"^ only those components lying in the ei-e2 plane. This projection is equivalent to first 
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projecting /["^ with ei (g) ei + £2 ® £2- But this is exactly the projection tensor of Eqn. (|94(l . with p = ps being the 
outgoing photon momentum and v being the electron 4-velocity. Projecting the final polarization vector with P^^ 

does not change it: P^a^sfi = ^sa- It follows that /f £3^6^^ — f^^P'^a^'^ p^tii^si^- Now it is safe to remove the 
outgoing polarization vectors from Eqn. (|139|l . 

We conclude that, for any initial and final polarizations, 



dfr{Ps,Ve) 

dt 



V, )P''p{ps, V, ) j dn, ff{p,, ve) . (140) 



Eqn. H 140(1 is the key result for Thomson scattering in the polarization tensor formalism. It gives the photon scattering 
rate per unit volume for given momenta and polarizations. 

If this argument seems a little dry, we note that the appearance of these projection tensor follows straightforwardly 
from elementary classical electrodynamics. Consider scattering of a quasi-monochromatic beam with central frequency 
Lo incident in direction n*^*^ by a single electron at rest. Let the complex amplitude of the (analytic signal of the) 
incident electric field be £j('^(i), with E^"^"^ ■ n'^^^ = 0. The incident wavevector is fc = u:n'^^^ jc. The complex dipole 
moment of the radiating electron induced by the incoming wave is p = [e^ /muP')E^'^\ The analytic signal of the 
electric field of the dipole radiation in the far field produced by the oscillating electron (the scattered field) at position 
x' = x'ris is, using the formula for electric dipole radiation |2m |. 

^ !^ ^oj^[p-{ns-p)n,] . (141) 

47r X 

The matrix Qij matrix of the scattered beam is thus given by 



x'^d^l' Q,j{ns) ^ x'^dW {Er,^E;^ 



a 

1 1 \ I I / -- I'* J I '(■ 1 1 \ r I , ■ r< , 

—cTTdn' 

4tt 



- ..... 

where ar = (e^/mc'^)^/127reQ is the Thomson cross section (in SI units), and we have defined 



Pins){Q{n^'^))P{n,) , (142) 



g(n«) = {El'>{t)E]'>{t)) 



3 



[Pins)h, = S.,-[nsUns], . (143) 

The matrix P{ns) is the projection tensor we saw before, which can be thought of as selecting the components 
of the incoming field which are transverse to the wavevector of the scattered field. The extension to a arbitrary 
"polychromatic" incident beam with frequencies not restricted to a small waveband follows provided the electric field 
components in seperate wavebands are completely uncorrelated. 

If the integration time is sufficiently short that we can ignore multiple scatterings, we may replace neCT dt in 
Eqn. H140|) with the optical depth to Thomson scattering, r. Then we have 

/r(A, ^7e) = ^ P\{Ps,v, )P''p{Ps,v, ) j dn, ff{p^,v,) . (144) 

It follows that scattering of a photon with a given incident momentum pi leads to a scattered beam with normalized 
polarization tensor 

cP^^Ps, Ve) = ' ^ ' . (145) 

PjS{Ps,Ve)(l)] {pi,Ve) 

Taking the trace of the scattering rate Eqn. (|140|l yields the evolution equation for the scalar distribution functions 

fiipi) = 9af3 f"^ (Pi) , fs{pi) = 9apff^{Ps), which may be written in the following form (by definition of the differential 
scattering cross section): 

^{fs d^x d^ps) = (ne d^x) dUs (/, d^K) , (146) 
dt diL 



which yields the differential scattering cross section in the rest frame (and the Thomson limit): 

Pa0{Ps,Ve)4>i^iPi,Ve) ■ (147) 



da 3(Tt r~, / - ^ \ lap I 



dfls 8n 



23 




FIG. 4; Polarization magnitude versus rest frame scattering angles (in polar coordinates). The plots for incident beams with 
Stokes parameters: (a) Q = U = V ^ 0, (b) Q/I = 0.9, U = V = 0, (c) Q/I = 0.5, U = V = 0, (d) U/I = 0.5, Q = V = . 

In rest frame coordinates, we may deal with 3x3 matrices rather than tensors. Then we may write the normalized 
polarization matrix of the scattered beam in terms of that of the incident beam as: 



(l>sins) 



P(n,)0,(n,)PK) 



(148) 



The polarization magnitude of the beam reduces to a familiar form in the case of an unpolarizcd incident beam. For 
example, consider the case of an unpolarized beam incident in the z-direction in the rest frame, and let the rest frame 
direction vector of the scattered beam have components 



Us = (cos ips Vl - Ms: sin tps y/l - fl'j, fls 



(149) 



The incident normalized polarization matrix is (p{ni) = i-'(n,;)/2. The polarization magnitude of the scattered beam 
is given by 



n2(/,„<^,) = 2Tr[02]/Tr[0,]2-l 



1 + 



(150) 



which is independent of ips since the incident unpolarized beam picks out no preferred azimuth. In the case of an 
incident beam with a general polarization state, we may choose polarization basis 3-vectors ^ for the incident beam 
ei = Bx, £2 = By and write, in the 2x2 polarization subspace, 



: _ ]_{ I + Q U + iV 
2l\U-W I-Q 



(151) 



^ The unit 3-vectors pointing along the Cartesian coordinate axes x, y, z are denoted ex, ey, e^. 
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FIG. 5: Phase function $ versus rest frame scattering angles (in polar coordinates, for a beam incident along the z-axis with 
Stokes parameters: (a) Q = U = V = 0, (b) Q/I = 1, U = V = 0. 



Then we find the scattered polarization magnitude 



2 



U^{n„ips) = l + ^ J. (152) 

+ + (-1 + Ms^) (Q cos(2 (^,) + [/ sin(2 (p,)) //]' 

This may also be written as 



P 



1 - nsinsj(t>f 



This function is plotted for incident beams with various polarization states in Fig. ^ 

From Eqn. (|148|l we can determine the probability for a photon to Thomson scatter into a particular solid angle 
element dVls , which is conventionally termed the phase function. This is simply proportional to the differential cross 
section, which in matrix notation is 

= ^ Tr[0,(n,)P(n.)] . (154) 

Thus the phase function for Thomson scattering is a function of the scattered direction vector and the elements 
of the incident polarization matrix <f)i{ni) (the dependence on rij is implicit in (f)i{ni)). Denoting the phase function 
as $[ms, tpiini)], we use the normalization 

$[n„(^,(n,)] ^ = 1 . (155) 

Since J Pij{ns) dftg /4:Tr — 2Sij/3, we have 

$[n„ 0,(n,)] = ^ Tr[</.(nOP(n,)] . (156) 

For example, consider the case of an incident beam with ~ (0, 0, 1), and intensity polarization matrix with Stokes 
parameters defined with respect to polarization basis vectors (associated with the incident beam) ei — e^, €2 = eyi 



I + Q U 
U I-Q 




lin) - i I U I-Q \ . (157) 



and let the scattered direction have the components (|149|l . Then we obtain the phase function p^ : 

$[n„0,(nO] - ^[1 + m'- (1-Ai?) (Q cos 20, + [/ sin 20,) //] . (158) 

In Fig.[5lthis function is compared for unpolarized and completely polarized incident beams. The polarization of the 
incident beam destroys the azimuthal symmetry of the differential cross section and phase function. 

This completes our discussion of the generation of polarization by classical Thomson scattering in the electron rest 
frame. In the next section we use these results to construct the photon Boltzmann equation for Thomson scattering. 
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P = P 





(a) Gain 

FIG. 6; Gain and loss processes in kinetic/master equation for Thomson scattering. 

VI. KINETIC EQUATION IN THE THOMSON LIMIT 

The evolution of the polarization matrix of the radiation field due to Compton scattering is determined by the 
Boltzmann (or kinetic) equation 

P"9a^. = C^. , (159) 

where Cp,y is the Compton scattering term. In this section we derive this scattering term for arbitrarily relativistic 
electrons and polarized photons. In fact, since the CMB photons have negligible momentum in comparison to the 
electron rest mass, the SZE can be calculated accurately with a simpler scattering term derived in the Thomson limit, 
in which the electron recoil is ignored. However, we go through the complete relativistic calculation in any case since 
there are other applications in which the recoil effect cannot be ignored. 

We do however ignore the effect of induced (or stimulated) scattering, which is required for example to obtain the 
Kompaneets equation often used to derive the thermal SZ distortion. But the terms due to induced scattering in the 
Kompaneets equation are negligible in the case of cluster SZE, and in general in the unpolarized case it is known that 
induced scattering is a negligible effect unless electron energies are comparable to the electron rest mass. In any case 
a rigorous derivation of the induced effects require a quantum treatment [s^l , which we have not developed here. We 
also neglect electron polarization, assuming that frequent Coulomb collisions destroy any spin alignments, and Pauli 
blocking (which is irrelevant in the regimes of interest). 

We now use the preceding results to derive the Boltzmann collision term in the electron rest frame. This is derived 
by the following heuristic line of reasoning. If we ignore polarization and assign a scalar distribution function f{p) to 
each photon, the scattering rate is given by Eqn. (|146|) with the cross-section for the transition from pi to Ps replaced 
by its unpolarized form, which in the Thomson limit is 

^-'-^[l + in.-n.r] . (160) 
ails Ion 

We could then write the rate of change of phase space density by subtracting from Eqn. 1146|l the rate of scattering 
out of Ps ■ That result is known as the master equation or Boltzmann equation for / |y, f22,J : 



jj{p)^n^j dQs J d^p^^{ps]P^)f{p^) [5^ {Ps - p) - 5^ {p^ - p) 



(161) 



The meaning of the master equation is that the rate of change of the photon number in a given phase space element 
is given by summing over all scatterings into and out of this element. In this expression, Ps is not a free variable, it 
is a function of the incident photon momentum and scattering angles, Ps — Ps{pi, ns), determined by the scattering 
kinematics. In the Thomson limit, |ps| = so the scattered photon momentum is simply given by Ps{pi,ns) — 
\pi\ns. This allows completion of the integral over the first delta function. 

The first and second terms inside the square brackets correspond to scatterings into and out of the beam (with 
momentum p) respectively, and are termed the gain and loss terms. The delta functions select the appropriate states, 
as indicated in Fig. O Eqn. H161|) is simply a statement of photon number conservation combined with the rate of 
scattering into the final momentum state H146|l . 

Now we wish to generalize this to the polarized case. The polarization tensor allows us to extend equation H161|) 
to a general polarization state, and write down the kinetic equation for polarization corresponding to Eqn. (|140|) . 
Because the transition rate is linear in fi and fs it is possible to write the scattering rate for a linear superposition 
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of initial states to a linear superposition of final states. Assuming linear superposition for incoherent light, we can 
write the most general incident state as /"^{pijVe) and ask for the transition rate of each element of this matrix. The 
transition rate is a linear transformation from f"'^{pi,Ve) to f'^'^{ps,Ve) and must therefore take the following form, 



with some matrix that we call the polarization scattering tensor. It is convenient to write this as <l'[!^^(s; i), where 
the arguments (z) and (s) are abbreviations for the pairs of 4-vectors [pi^Ve) and {ps,Ve), 

The polarization scattering tensor is effectively a 4 x 4 matrix giving the transition rate between all possible initial 
and final polarization states. It follows from Eqn. (|140ll that in the Thomson limit the polarization scattering tensor 
is given by 

= ^-^P^iPs^V,)?" fAPs,V,) ■ (163) 

Now we make the following ansatz for polarized analogue of the master equation corresponding to Eqn. p61|l : 

^/'"-(p, V,) ^n,J d^s I d'p, (p„ Ve;p,, t/e) r'{p^. Ve) 

X [S^J'-p -p)- 9a,p r^iP, V,) S'ip^ - P)] . (164) 

This is the rest frame form of the scattering term C^" in Eqn. (|159|l . With this ansatz for the master equation, it 
may be checked that for any two initial and scattered pure states /"^{pi, Ve) — /iff (ef )* and f^'^{ps, Ve) = /se^(es)*, 
Eqn. H164|l reduces to Eqn. 1)161(1 with the Thomson cross section Eqn. 1)147(1 . Then since any polarized beam can be 
written as some superposition of pure states, it follows that Eqn. I(164() is true for all polarization states. This verifies 
that the ansatz ((164() is correct in the Thomson limit. 

The first term in the square brackets is exactly the gain term of Eqn. 1(1401) . The second term represents losses to 
any final polarization state; the sum over polarizations is given by gaff- Each incident photon beam with polarization 
tensor (j)'^^{p, Ve) is lost by scattering implying that the loss term must be proportional to (p'^'^ip, Ve)- Just as the loss 
term in Eqn. I(161() is proportional to the same quantity occurring on the left-hand side of the equation, the same is 
true here. In fact this loss term is simply the phase function multiplied by the incident beam, and is thus proportional 
to the probability of a photon scattering from momentum pi to ps- To see this, note that the loss term contains the 
scalar obtained by contracting the projection tensors which are orthogonal to the incident and scattered photons: 

P"^ (pi , Ve)PaP {ps,Ve)=2+ — p,- Ps + ^f' . (165) 

PiPs [PiPsY 

where Pi = -pi-Ve,Ps = -Ps-Ve- In the rest frame of , _pf = p,;(l, n^), p^' =Ps(l,ns), giving prp^ = ~piPs{l~ni-ns), 
and the loss term scalar has the form 

P''''ip^, Ve)Pal3iPs,Ve) ^ 1 + (m ■ n,)^ , (166) 

which is the familiar angular dependence of the differential cross section for Thomson scattering of unpolarized 
radiation. The total loss term is thus simply proportional to the incident beam multiplied by the total cross section 
(in this case, since we have restricted to the Thomson limit, the Thomson cross section). 

Note that the form of Eqn. 1(164(1 guarantees photon number conservation (Compton scattering cannot change the 
overall photon number) : 

3 dp'^{p,V,) 

d p g,,^ = . (167) 



Integration over the delta functions yields the much simplified form of the rest frame kinetic equation 



3 



I l^P'MVe)P''ffip,Ve)r^ip^,Ve) 



r{p,v,) 



(168) 



^ It is important to remember that on the left hand side, the derivative d/dt stands for the operator d/dt + n ■ V, where p*^ = n) in 
a local Lorentz frame. 
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This equation, in conjunction with the transformation to lab frame and integration over electron velocities, discussed 
in the next sections, is sufficient to compute all the Sunyaev-Zeldovich effects. 

At this point we consider the form of Eqn. H168|l in the case where the polarization tensors on the right hand side 
are taken to be unpolarized. This will be used in Paper 11 in the computation of the polarization generated by a single 
scattering of an unpolarized radiation field. In this case the integrand of the gain term in Eqn. H168() is proportional 
to the following combination of projection tensors: 

G^''{p,,Ps,Ve) = P\{ps,Ve)P%{Ps,Ve)P''^{p.,V,) . (169) 

In the electron rest frame, the spatial components of the integrand of the gain term are given by 

= {6'^ - nlnj) - n\n{{\ + (n^ ■ nif\ + {n\n{ + n{n\)(ns ■ rn) . (170) 

This form is convenient for the calculation of the polarization generated by scattering of the CMB quadrupole, in the 
companion paper (and, for example, in 36] ). 

We close this section with a demonstration that the rest frame form of the kinetic equation, Eqn. (|168|l . yields the 
well known results of for the polarized radiative transfer equations in the case of Thomson scattering from cold (i.e. 
stationary) electrons in a slab geometry. Since Chandrasekhar used a different formalism based on transformations 
of the Stokes parameters to derive his expressions, this is an important check of the formalism we have developed. 
These equations also yield the form of the scattering term for the polarization in the primary CMB calculation. 

Consider a plane parallel atmosphere of cold electrons with uniform density filling the half-space z < 0, illuminated 
by an unpolarized beam of monochromatic radiation incident along the normal to the plane (we do not specify the 
boundary conditions here, since we are only interested in deriving the form of the transfer equations and not their 
solution). By symmetry the radiation field in this case is clearly axisymmetric. We work entirely in the electron rest 
frame, where = — 0, and following define r to be the optical depth from the surface z = along the 
downward normal. The Cartesian coordinates in the plane are x and y, and The polar and azimuthal angles about 
the z-axis are denoted 6* and (and /i = cos 6*). 

Then the kinetic equation for the lab frame intensity polarization matrix /(/i, r) is given by Eqn. (|168|l (replacing 
distribution function tensors by intensity tensors, which is trivially allowed here since the beam is monochromatic): 

3 /" ^ /'27r 

fj,drI{n,'p,T) = --P{n) / -^I{fJ,s,(Ps,T)P'^{n) + I{ti,if,T) . 

(171) 

where n{9,ip) = (cos sin 0, sin sin 0, cos 0) is the direction 3- vector of the beam. Note that in this expression, 
following 14], the gain and loss terms have picked up a minus sign since the direction of increasing optical depth 
(along the normal to the boundary z = directed into the half-space z < 0) is defined to be opposite to the polar 
axis. 

In contrast to the Stokes vector approach of 14] , the polarization matrix even in this azimuthally symmetric problem 
has azimuthal dependence. But we are free to exploit the azimuthal symmetry here by choosing a convenient azimuth 
to evaluate the projection factors, and then the result obtained for this azimuth may be transformed into the other 
directions trivially. Choosing (p — 0, we have {n^ ,71^ ,n^) — (^1 — /i^, 0, /i). In the tp ^ direction, has the form 
(since the yx and yz cross-terms must vanish in order that the place of polarization is parallel or perpendicular to 
the y — z plane as required by axisymmetry) . 

/ /zAiVw^N 

/(m,0,t)= /, . (172) 

//(1-m') / 

Here Ii{fi,T) and Ir{fi,T) are the azimuth independent Stokes parameters, parallel and perpendicular respectively 
to the meridian plane, as defined in |14| . The matrices inside the dips integrals range over all values of (fs though, 
so an expression for (^s , <<5s , r) is required. By azimuthal symmetry, this is simply given by rotating J'^'(/Zs, 0, r) 
through an angle ips about the z axis (since under rotation polarization matrices transform according to the vector 
rotation of the electric field strength vectors): 



7(/is,(ps,r) = R{-(psz)Iips,0,T)R^{ipsz) 



(173) 
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where R{(psz) is the matrix which rotates through angle (ps about the ^;-axis 



IlHl COS^ + Ir Sin^ --^iMs sin ip^ cos + cos sin p^ Illls\/' ^ - /If c os ifia 

cos sin + COS (/3s sin (^g /j/Hs sin^ (^s + cos^ </?s —IiiJLg\Jl — n1 sincps 



Iil^s\/i - Ms cos (ps 



Evaluating the I"^ component first, we obtain 

3 



-IilJs\/^ - sin 



where 



Breaking this into three terms for clarity 



^^^r^''ifI,0,T) = fl^ drIl{tl,T) 

dfj,, 

-1 



where 



47r 



{Ii{iJ's,t)i4 cos^ (ps + /r(/"s, t) sin^ ips) 



fj,'^ j dUs {ll{Hs,T)nl+Ir{lJ,s,T)) 



H {I- ^ ) j d^s 



d(ps 
Ait 



J;(/Us,t)(1 - III) 



dUs J h iHs ,r)HsV^- l4 cos ifa 



= . 

Adding the three terms above and dividing by fj,^ gives 

3 



The equation for Ir is obtained similarly from the evolution of the I^^ component, 

iidrpy{fi,o,T) = 

where 



(174) 



(175) 



(176) 



(177) 



(178) 



(180) 



(181) 
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Therefore 



^ J-1 JO 



47r 



(182) 



Eqns. p79|) and (|182|l are the coupled radiative transfer equations for the Stokes parameters for Thomson scattering 
in slab geometry, first obtained by Chandrasekhar [T3|. 

These equations also yield straightforwardly the form of the polarization scattering term in the Boltzmann equation 
for the evolution of the primary CMB anisotropies, in the form given (but not derived) in |13|, as follows. In a frame 
in which the fe-mode considered is aligned with the local 2;-axis, and the observation direction is n = (^1 — /i^, 0, /i), 
the form of the polarization matrix of the CMB is: 



(/-n)^2 





(/-n)Aiv/T^ 





/ + n 

ii~u)^iy/r^ (/-n)(i-/i2) 



(183) 



where we have defined the total intensity I = Ii + Ir and total polarization intensity H = Ii — Ir- The polarization 
magnitude II is obvously independent of ip in this azimuthally symmetrical situation. Using the previous equations 
for // and 7^, we find 

tl ^ U-^{1- fl^) J dfi' [Il{3fi'^ - 1) ~ II + Ir] 

= n-i(i-P2(M)) f^'^[iP2{ti') + i^{P2{p')-P,{m ■ 



(184) 



Following [13, we define 



A, 



With n = Ap, we obtain the standard form of the CMB polarization scattering term, 



D 

D-r' 



-TT 



-Ap + ^ (1 - P2{^J)) (A2 + Ap2 + Apo) 



(185) 



(186) 



(Note T is now conformal time, tt is Thomson optical depth, D /Dt = d/dr + ikfi. The second term in the total 
derivative comes from the other terms in the Liouville equation). 



VII. KLEIN-NISHINA SCATTERING 



Up to now we have worked in the Thomson limit. In this section we extend to the general case of Compton 
scattering with the full Klein-Nishina form of the scattering cross section, and taking into account the electron recoil. 

For now we work still in the rest frame of the electron before scattering (the "initial" electron). To derive the 
polarization tensor kinetic equation our starting point is the Klein-Nishina differential cross-section |23|, in the ini- 
tial electron rest frame, for photons with 3-momentum pi and polarization to scatter into 3-momentum Ps and 
polarization £3, generalized to allow for arbitrary elliptical polarization * |42j : 



da 
dfls 



3ctt / Ps 
8tt \pi 



.*|2 



4 \Pi Ps 



1 



(187) 



* This has been verified |23|l by A. H. Guth, by performing the explicit QED computation with complex polarization vectors using a 
computer algebra system. 
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where pi = E{pi) = \pi\ and ps = E{ps) = \pi\. We allow the polarization vector to be complex in order to treat 
elliptical polarization; the polarization vectors are normalized by ei-e* = e^-e* = 1. The factor (l + |ei-e*p — le^-esp) 
is usually not given as it reduces to unity for linearly polarized light, but we allow for light of arbitrary polarization. 
Eqn. (|187|l assumes the transverse gauge condition Ei Pi = Es Ps — and that the time component of both polarization 
4- vectors vanishes in the initial electron rest frame. The factor {ps/PiY ™ the cross section is a phase space factor. 
Conservation of energy-momentum relates the initial and final momenta and scattering angle: 



Ps_ 

Pi 



— (1 - n,; 



= 1 (1 - rii ■ Us) 

m„ 



(188) 



where rii and are unit three- vectors along the spatial parts of the photon momenta pi = pitii and Ps = Psng. 
Note that for fixed directions, dps/dpi = {Ps/PiY- 

Equations H187(l and (|188|l both assume that all quantities are given in the rest frame of the incident electron. 
We note, incidentally, that the Klein-Nishina formula should be symmetric under the interchange of the initial and 
final states, but the gauge condition which was imposed to derive this form of the cross section required that the 
incident and scattered photon polarization basis 4-vectors be orthogonal to the incident electron, and it thus appears 
rather asymmetric. However, it is true that there is nothing special about this choice of gauge. In Appendix EI it is 
demonstrated that the cross section is manifestly symmetric under interchange of the initial and final states. 

The 4- velocity of the incident electron will from now on be denoted Vi, rather than Vf. (since now we have to 
distinguish between the incident and scattered electron momenta). Now following the procedure in the previous 
section, we write the transition rate of the scattered polarization matrix f'^'^{ps,Vi) as a linear transformation of the 
incident matrix /"^{pijVi), as in Eqns. ()162|l and (|163|l . except we pull the phase space factor out for convenience: 



Vi)d^Ps 



Vz;Pi,Vi) f°'^iPi,Vi) (— ] d-^pidQs 
\Pi 



(189) 



The scattering polarization tensor for the Klein-Nishina cross-section is then given by the modified form: 



3o-T j 
8n I ' 



r \ I ^ I I Pi 



(190) 



where 



(191) 



and the arguments {i) and (s) are abbreviations for the pairs of 4-vcctors {pi,Vi) and {ps,Vi), and ps — ~Vi ■ Ps and 
Pi = ~Vi ■ Pi. As a check, if we consider an unpolarized incident beam with (f>'^^{i) = ^P"^{i) and sum over final 
polarizations, we get the usual spin-summed result for the Klein-Nishina cross section: 



;Pt,uiPs:V,)<i>[:^'l^{ps,v^;p,,v,)P'^'^{pi,Vi) 



16^ 



El 

Pt 



Pi . 2q 

sm 

Ps 



(192) 



where 9 is the scattering angle, cos 6* = • rig. 

Now, as in the derivation in the Thomson limit case, we make an ansatz for the rest frame kinetic equation: 



d_ 



r{p, V,) = nej dn^ J d'p, (^^ j (p„ vf,p,, V,) P\p^,v,) 
X [S^^J'^p S^p, -p)- g^p r%P. V.) S'ip, - p)] . 



(193) 



That this expression is correct can be verified as in the Thomson limit case by substituting the matrices of pure states 
and checking that Eqn. (|161|l is regained with the the Klein-Nishina cross section (I187|l . It is important to understand 
that, as in Eqn. (|161|) . the scattered electron 3- momentum pg in the expression above is not a free variable — it is 
determined by the scattering kinematics as Ps — Ps^s, where Ps is given as a function of Pi and by Eqn. H188(l . 

Having obtained the equation for polarized radiation transfer in the rest frame of the scattering electron, we now 
consider the general case of scattering from a distribution of electrons with varying velocity. To obtain this, first it 
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is necessary to transform the kinetic equation to a common lab frame. Henceforth, components of 4-vectors in the 
rest frame of the initial electron are denoted with primes, and those in the lab frame without primes. The incident 
particles are denoted with a subscript i, and the scattered particle with a subscript s. Photon momenta are denoted 
by p, and electron momenta by q. As usual, quantities with arrows are 4-vectors, and boldface quantities are 3- vectors. 
The 4-momenta satisfy the mass shell conditions Pi ■ Pi = Ps ■ Ps = 0, qi ■ qi = qs • Qs — ^nT-e- The electron energies are 
given by E{q) = + |qr|2. 

In lab frame, the initial and scattering electron 4-momenta are written in terms of the lab frame electron 3- velocities 
as follows: 



rrieVi = 7ime(l, Vi) , 7i 



nieVs = 7sme(l,i's) , 7s 



1 



1 



The Lorentz transformation into the rest frame of the initial electron is given by the matrix: 

A° ^ 7., A° = -7. [v.], a; = 5] + (7, ^ ^) [<K 
This yields the transformation of the photon direction vector between frames: 



n'i = [7^(1 -Vi)]' 



rii 



It 



(194) 



(195) 



(196) 



The relationship between the rest and lab frame momenta of the incident and scattered photons is 

P's = -Ps ■ qi/rue = — [psqi ~ Ps ■ , 
- ^ \ 1 



(197) 



Or in terms of the 7,; factor, 



Pi 



7i (l-rii- Vi) , 

= 7,(l-n, --uO . (198) 

Ps 

The scattered and incident photon energies in the rest frame are related by the familiar Compton scattering formula: 



p\ 



The lab frame version of this is 



1 + (K/"^e) [1 - K • K\ 

Pi(\ - rij ■ Vj) 
s ■Vi + {pihiiTie) (1 - 



(199) 



(200) 



We are now in a position to Lorentz transform Eqn. (|193|) to a lab frame in which the electrons have 3-velocity 
Vi. Four quantities need to be transformed: n^, di', the 3- vector p' of the beam on the left hand side of the master 
equation, and f^^ip^Vi) itself because of its dependence on Vi. Let the four-vector p have spatial components pn in 
the lab frame and p'n' in the electron rest frame. The transformation laws of p and n have already been derived: 



p' = 7ip(l -n-Vi) , 
= [7j(1 - n ■ Vi)Y'^ 

The Lorentz transformation of the electron density is 



X +1 



Vi{n ■ Vi) - jiVi 



(201) 



(202) 
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This is simply due to the Lorentz contraction of the volume element. 

The Lorentz transformation of the time element is more subtle; dt transforms like p, not like n^: 

dt' ^ - n ■ V,) dt . (203) 

From a mathematical point of view, this is because the transport operator on the left-hand side of the Boltzmann 
equation is actually the directional derivative d/dX = {dx^^ / d\)d / dx^^ + {dp^ / dX)d / dp^^ = p^d/dx^^ (in flat space). 
In a local Lorentz frame, d/d\ — pd/dt where d/dt = d/dt + n ■ V. Physically, the transformation of dt arises due 
to the enhancement of the rate of scattering of photons from electrons which are approaching compared to that from 
electrons which are receding, which is due to the dependence of the flux of photons incident on the electrons on their 
relative velocity. Thus we refer to the factor 7^(1 — n • Vi) which appears in Eqn. H203I) as the flux factor. This factor 
is crucial in the derivation of the SZ effects! 

It follows from the transformation of the time element and the invariance of the trace of the distribution function 
tensor, that the left-hand sides of the Boltzmann equation in rest and lab frame are related by ^ 

^ = I m (204) 

dt' 7,(1 -n-nO dt ' ^ ' 

or alternatively 

df{p) _ 1 df'ip) 



dt 7,(1 +n' •■?;,) dt' 



(205) 



(The photon occupation numbers in rest and lab frames also satisfy these equations, of course). 

We now have all the ingredients needed to transform equation (I193|l to the rest frame of an observer with 4- velocity 
vi (the subscript standing for "lab"). The transformation of the polarization tensors from lab to rest frame was 
derived in illVI 

r'"-' {p, V,) = P^'^{p, v,)P'''M vr)r{p, vi) . (206) 

It is convenient to insert the tensors which project into the electron rest frame into the scattering tensor, by redefining 
the tensor P^p{s) which appears in Eqn. H190|) as 

P^;{s■^) EE P^^^{s)P^MP\{^P'p{^ ■ (207) 

where the arguments (i) and (s) are abbreviations for the pairs of 4- vectors (pi, Vi) and (ps, Vi), and Ps = —v-i ■ Ps and 
Pi = -Vi -pi. 

Finally, we can generalize the electron density to a distribution of electrons, rig = ^ d^qige{qi) where qi is the 
electron 3- momentum, and is the scalar phase space distribution function for the electrons. Putting everything 
together, we obtain 



Pj^r\p,vl) = P''^{p,vi)PMp,vi) J -^g,{q,)m,p' 

y- j dn'.j d^p[ <i>;f(p;,z;,;p;,t/.)r^(pi,i/,) 

X [5\5^p SHp's - P') - 5a/3 ^'"'iP^,Vl) S^p', - p')] , (208) 

where E{qi) = ^iUif,. Primes denote components in the rest frame of Vi — qi/nif,, e.g. p' = —Vi ■ p. The flux factor is 
contained in the p' factor in the first line on the right hand side (the Lorentz invariant measure d^qi/ E(qi) is pulled 
out to facilitate the derivation of the covariant form to follow). Note that the projection operators which project 
into the rest frame of Vi are already present in . Similarly, it does not matter whether (j)'^'^ is evaluated in the rest 
frame of Vi or vi, because of the projection operators in front of the integral. It follows that we may, without loss 
of generality, drop the 4-velocity argument from f^"^ and (/i^" provided it is understood that the final results must 
always be projected into the physical polarization space of the observer. 



Recall that p is a Lorentz covariant 4-vector in these expressions. 
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Eqn. H208() looks complicated and is not the most convenient form for computation. The delta functions can be 
integrated over resulting in a simpler expression. This requires the Jacobian of {p'-,n'^) and (p'j,,n^), which follows 
from Eqns. (jl99|l and H2()l|l . yielding: 



dfi- d^p' 



P's 



(209) 



The Boltzmann equation now becomes 

d 



df 



r{p)^ / A..9e(9.)(l 



dVL' 



P's 



r-{p)9o.p^"nPs,v^;p, v,)P\p) 



(210) 



This form is convenient for both analytic and Monte Carlo calculations. The flux factor is explicit in the integration 
over the electron momenta. (Recall that f^'^ must be projected into the observer frame at the end of the calculation.) 

The covariant kinetic equation for Compton scattering was derived for unpolarized photons by '2rf| . Their expression 
for the time evolution of the photon phase space distribution function has the form (Eqn. (2.3) of ,20iJ): 



d f d^qi 



d^ 



92 



d^P2 



\M\^ d^{pi+qi-p2-q2) 



)J E{q2)J E{p2) 

/(P2)(l + f{Pl))9e{q2) - f{pi){l + /(P2))5e(gi) 



(211) 



where |Mp is the squared matrix element for unpolarized Compton scattering. Note that this contains stimulated 
emission factors, which we ignore in our treatment of the polarized case. The integration measures are Lorentz 
invariant: / d^qi/ E{qi) = J d'^qi6[^{qi ■ qi + ml)]. E{q) = (7° is set by the mass shell condition. We complete 
our discussion of the Boltzmann equation by checking that the lab frame kinetic equation can be recast in such a 
manifestly covariant form. We will need the following identities: 



d\i 
E{qi) 
dhi 



d^ 



92 



d^P2 



E{q2)J E{p2) 

dVL2pl f d^q2 



S^iPl +<fl -P2 - 92) 



Eiqi) J {-P2 ■ 92) J E{q2) J {-p2 ■ qi) 



d^l2 P2 



(212) 



These follow from writing the time part of the delta function as 6[E{pi) + E{qi) — E{p2) — E{pi + Qi — P2n2)] or 
5[E{pi) + E{p2n2 + q2— Pi) ~ E{p2) — E{q2)]. To get the forms that we finally need, we replace the denominators of 
the angular integrals using the identities pi ■ qi = P2 ■ 92 and pi ■ q2 — P2 ■ qi, which follow from conservation of total 
4-momentum. 

With these identities, the Boltzmann equation finally takes a manifestly covariant form. 



d^qi r d^q2 f d^p2 
E{qi)] E{q2)J E{p2) 



S'^iPi +91 -P2 -9*2) 



^';s{pi,V2;P2, V2)r\p2)ge{q2) - {pi)go.p'f^^ {p2, vi; p^, v^) r\p^)g4q^) 



(213) 



The integration measures are Lorentz invariant: J d^qi/E{qi) — f d'^qiS[^{qi ■ qi + ml)]. It may be checked that 
working backwards from this equation, integration over the 4-dimensional delta function yields the rest frame form of 
the master equation Eqn. (|208|l . 



APPENDIX A: SYMMETRY OF KLEIN-NISHINA MATRIX ELEMENT 



It is apparent that the square of the invariant amplitude for the Klein-Nishina formula should be symmetric under 
the interchange of the initial and final states, but it is written in a way that is very asymmetric. Here we show that 
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it is possible to write the K-N invariant matrix element in a way that is manifestly symmetric between the initial and 
final states. ^ 

The matrix element for Compton scattering is usually written in an asymmetric way. One can call the initial and 
final electron 4-momenta qi and (fy, and the initial and final photon 4-momenta pi and p/. Using conventions for 
which the Lorentz dot-product is defined as time minus space, one defines pi — pi ■ ql and P2 — Pf ■ Qi- That is, in 
the initial rest frame of the electron, pi and p2 are the initial and final photon energies, multiplied by me, the mass 
of an electron. One also defines polarization vectors for the photons to have zero time-components in this frame, so 

• 9i = 0, e/ • = 0. The differential cross section is then 



7 



da o? ( Pi 



2 



M\ (A.l) 



dU, Ami V-Pi 
where 

M2 = ^ + ^-2 + 4(e; •e.)2 . (A.2) 
Pi P2 ' 

The invariant matrix element should be symmetric under the interchange of initial and final states, z <-> /, but it 
does not look that way. However, it really is. One might think that the non-invariance of the • e*/ term is perhaps 
canceled by the noninvariance of the rest, but it turns out that it is much simpler than that. Each part is separately 
symmetric. To see this, note that conservation of 4-momentum implies that 

Pi+Qi^ Pf + Qf ■ (A.3) 
Squaring both sides, and using the fact that qf — qj and pf — jyj, one has immediately that 

qi-Pi^Qf-Pf , (A.4) 
so in fact pi is invariant under i ^ f . Similarly conservation of 4-momentum implies that 

Qi -Pf = Qf - Pi , (A.5) 



and squaring implies that 



Qi - Pf qf -Pi , (A.6) 



so p2 is invariant under i ^ f . 

The only remaining problem is the {ti ■ e/)^ term, which is not manifestly invariant, since the e's were both defined 
to have vanishing 4th components in the initial rest frame of the electron, so q • = 0, e*/ • = 0. One can use an 
arbitrary gauge for the polarization vectors, however, if one explicitly constructs the gauge transformation satisfying 
e-qi = Q before calculating the dot product. That is, if q does not satisfy Ci ■ ql — 0, then one constructs 

-) ^ ^i ' qi ^ /- A -7\ 

<^i = <^i-^^Pi (A.7) 
Pi ■ qi 

and 

SO • (fi = e'j: ■ qi = 0. To continue, it is useful to define a more compact notation. Let 

Aafj = ea-qp (A. 9) 

Bap = ea-Pf3. (A. 10) 



® This appendix is based on a private communication from A. H. Guth l23l . 
Note that in this Appendix we use the standard Klein-Nishin a c ross section for linearly polarized photons, not the general formula in 
Eqn. 11871 which is valid for circularly polarized photons too |42| . 
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where a and /3 can be cither i or /. Each polarization vector is orthogonal to its corresponding momentum, so 
= Bff = 0. In this notation Eqns. (A. 7) and (A. 8) become 

^, = ei-^Pi (A.11) 
Pi 

and 

So, for polarization vectors Ci and e/ written in an arbitrary gauge, the equation for must be written by replacing 
e, • e*/ with 

. = el • eV - - + ^^p^ ■ pf . (A.13) 

P2 Pi PlP2 

To proceed, we want to use some identities that follow from energy-momentum conservation. Dotting both sides of 
Eqn. (A. 3) with e^, one finds 

Aii = Bif + Aif, (A.14) 

and dotting both sides with e/ (and reversing the sides of the equation) gives 

Aff = Bfi + Afi. (A.15) 

Since one has 6 dot products — An, Aif, Afi, Aff, Bif, and Bfi — and two constraints (Eqns. (A.14) and (A.15)), 
one can eliminate two of the dot products from all expressions. The simplest result seems to arise from eliminating 
the B's. One also needs to simplify pi ■ pf, which can be done by dotting Eqn. (A. 3) with pf. 

P2 +Pi-Pf =Pi , 

so 

Pi ■ Pf = Pi - P2 ■ (A.16) 

Finally, substituting into Eqn. (A.13), 



_ Afii^it - A/) Az(>l// - Af^) AitAft 

Ci - Cf = Ci ■ Cf 1 (Pl -P2) 

P2 Pi PlP2 

= Ci • e/ H . 

P2 Pi 



(A.17) 



Written in this form, the result is manifestly symmetric under the i f interchange, and it is valid for polarization 
vectors ei and e/ written in an arbitrary gauge. One can check that the expression vanishes if Ci is replaced by Pi, or 
if e/ is replaced by pf. 
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